Spatial Decomposition of Inequality - unu-wider

7 downloads 0 Views 151KB Size Report
Income inequality is captured by an inequality index I(y) which is assumed ...... 98.7 rural province. Elbers et al. (2002). 195. 1994 na. 5.9. 94.1 rural Canton. 915.
Discussion Paper No. 2004/01 Spatial Decomposition of Inequality

Anthony Shorrocks and Guanghua Wan* January 2004

Abstract This paper reviews the theory and application of decomposition techniques in the context of spatial inequality. It establishes some new theoretical results with potentially wide applicability, and examines empirical evidence drawn from a large number of countries.

Keywords: inequality, index, decomposition JEL classification: C43, D31, D63, R12

Copyright

¤ UNU-WIDER 2004

* UNU-WIDER, Helsinki. This is a revised version of a paper presented at the UNU-WIDER Conference Inequality, Poverty and Human Well-being, May 2003 in Helsinki. It has been prepared within the UNU-WIDER project on Spatial Disparities in Human Development, directed by Ravi Kanbur and Tony Venables, with Guanghua Wan. UNU-WIDER gratefully acknowledges the financial contributions to the 2002-2003 research programme by the governments of Denmark (Royal Ministry of Foreign Affairs), Finland (Ministry for Foreign Affairs), Norway (Royal Ministry of Foreign Affairs), Sweden (Swedish International Development Cooperation Agency—Sida) and the United Kingdom (Department for International Development).

The World Institute for Development Economics Research (WIDER) was established by the United Nations University (UNU) as its first research and training centre and started work in Helsinki, Finland in 1985. The Institute undertakes applied research and policy analysis on structural changes affecting the developing and transitional economies, provides a forum for the advocacy of policies leading to robust, equitable and environmentally sustainable growth, and promotes capacity strengthening and training in the field of economic and social policy-making. Work is carried out by staff researchers and visiting scholars in Helsinki and through networks of collaborating scholars and institutions around the world.

www.wider.unu.edu

[email protected]

UNU World Institute for Development Economics Research (UNU-WIDER) Katajanokanlaituri 6 B, 00160 Helsinki, Finland

The views expressed in this publication are those of the author(s). Publication does not imply endorsement by the Institute or the United Nations University, nor by the programme/project sponsors, of any of the views expressed. ISSN 1609-5774 ISBN 92-9190-568-2 (printed publication) ISBN 92-9190-569-0 (internet publication)

1. Introduction Spatial disparities in living standards have been the subject of a great deal of attention in recent years. At the global level there has been concern at the prospect of rising inequality in the world distribution of income and the extent to which this is fuelled by factors linked to globalization: see, for example, Milanovic (2002); Sala-i-Martin (2002); Bourguignon and Morrison (2002); Fischer (2003); Kremer and Maskin (2003). Similar concerns surface within individual countries, especially those countries where income inequality has been rising over time and where average incomes vary considerably across regions or provinces. In China, for example, unease with the growing disparity between the living standards in the coastal areas and the inland regions has prompted the Chinese government to launch a campaign to develop the western regions (Kanbur and Zhang 2003). The problem becomes a more intense political issue when spatial inequality is perceived to be related to discrimination against particular groups of citizens such as rural farmers (compared to urban residents), ethnic minorities concentrated in remote areas, migrants in certain districts, or religious groups in particular regions (eg. Muslims in Xinjiang Region in China). The region of residence is not, of course, the only factor which accounts for differences in living standards: there are typically wide disparities in incomes within, as well as between, regions. Therefore, in order to appreciate the significance of geographical location it is necessary to have a method of separating out the contribution of the spatial factors. Many empirical studies of living standards make use of regional dummy variables, but the implications for the aggregate level or trend in inequality are rarely explored — at least in terms of the inequality measures commonly employed elsewhere. The principal alternative procedure begins with an inequality value for the whole population which is broken down or ‘decomposed’ into contributions associated with different spatial dimensions. Typically the aggregate sample data are partitioned into a set of geographical regions or districts, and the data analysed in terms of the inequality observed within each of the regions, the inequality due to variations in average incomes across regions, and, in some cases, the inequality attributable to ‘interactions’ or ‘overlaps’ between the regional income distributions. Similar decomposition procedures are routinely applied to partitions into population subgroups defined according to a wide variety of other criteria, including gender, age, education level, and 1

so on. The aim of this paper is to review the current state of knowledge regarding inequality decomposition in a spatial context although, for the reasons explained above, the broad thrust of the comments also apply to many other types of decomposition by population subgroups. We begin in section 2 with a discussion of the foundations of the methodology for inequality decomposition. Section 3 outlines a number of general results on spatial decomposition which should be helpful in interpreting and assessing empirical evidence on the subject. Empirical evidence drawn from a number of country studies is then reviewed in the light of the theoretical insights. Section 5 concludes the paper.

2. Theoretical Foundations of Spatial Decomposition of Inequality The analysis of spatial inequality typically begins with a measure of living standards or resources defined for a population of individuals or households. We follow common practice in referring to the measure of living standards as ‘income’, although it should be stressed at the outset that the income concept must be interpreted broadly to encompass not only home production and non-pecuniary income, but also all the advantages and disadvantages systematically associated with geographical location, including climate, regional price variations, local public good provision and environmental quality. In essence, the analysis assumes that individuals with the same income at different locations are equally well-off. The formalities below are framed in terms of a (homogeneous, equally weighted) population of individuals represented by N = {1, 2, ... , n}, with incomes given by the vector y = ( y1 , ... , yn ) and mean income denoted by µ. Income inequality is captured by an inequality index I(y) which is assumed throughout to satisfy the following five basic properties:1

1

These are all standard properties of measures of relative inequality. For more details see Silber (1999). 2

(A1) symmetry (or anonymity); (A2) the Pigou-Dalton principle of transfers (or strict Schur convexity): a mean-preserving progressive transfer reduces inequality;2 (A3) scale invariance (or homogeneity of degree zero); (A4) replication invariance; and (A5) zero normalisation: the minimum value of I is zero (achieved when all incomes are identical).

The decomposition of inequality according to a partition of the aggregate population into geographical regions (or, more generally, into any set of mutually exclusive and exhaustive subgroups) is most often undertaken with one of the entropy indices popularised by Theil (1967, 1972) and later explored in more detail by Bourguignon (1979), Shorrocks (1980, 1984, 1988), Cowell and Kuga (1981), and Foster and Shneyerov (2000), amongst others. The single parameter entropy family may be written: (1a) (1b) (1c)

Ec(y) 

yi c

1

M  i µ p  1" ,

c (c  1)n

yi

E1(y) 

1 n

i N

E0(y) 

1

M ln

n

M

c  0, 1,

i N

µ

i N

yi

ln µ yi

µ

,

,

or in the more condensed form: (2)

Ec (y) 

1 n

M φc ( y i / µ ) ,

i N

where φc ( t )  ( t c  1 ) / [ c ( c  1 )] , c  0, 1; φ1 ( t )  t ln t ; and φ0 ( t )   ln t . Special cases include the Theil coefficient (corresponding to c = 1), the mean logarithmic deviation (c = 0), and one half of the square of the coefficient of variation (c = 2). The decomposition properties of this class of measures are best illustrated by considering the

2

Note that (A2) extends to situations in which frequencies f1, f2, . . . , fn are attached to the income levels y1, y2, . . . , yn . Successive application of the principle of transfers implies that inequality will be reduced by any equalisation of two income levels which preserves the overall (weighted) mean income. 3

index E0 and by supposing that the set of individuals, N, is partitioned into m proper subgroups Nk (k = 1, 2, ..., m), with respective income vectors yk, mean incomes µ k, population sizes nk, and population shares νk = nk / n. It will also be convenient to let y k denote the distribution obtained by replacing each income in the vector yk with the subgroup mean µ k. Then m

E0( y) = E0 ( y1, y2, . . . , y m ) =

(3)

m

= M

k1

nk 1 n nk

M ln

i Nk

µk yi

m

m

k1

k1



1 n 1 n

= M νk E0( y k)  M νk ln

µ

M M ln

yi

M M ln

µk

k  1 i Nk m

k  1 i Nk µ µk

µ

= W+B

where m

(4a)

W = M νk E0( y k) k1

is a weighted average of subgroup inequality values, traditionally referred to as the ‘withingroup’ component of inequality; and m

(4b)

B = M νk ln k1

µ µk

= E0 ( y 1, y 2, . . . , y m)

is the ‘between-group’ contribution to inequality, representing the level of inequality obtained by replacing the income of each person with the mean income of their respective subgroup. Thus — for the index E0 at least — the overall level of inequality for a country can be expressed in an intuitively appealing fashion as an exact sum of the average inequality within regions and the inequality due purely to differences in average incomes between regions. To appreciate the special attraction of the decomposition indicated by (3), it may be noted that for any inequality index I(.) which satisfies properties (A1)-(A5), the aggregate level of inequality may be written (5)

I ( y1, y2, . . . , y m )  I ( w1 b1, w2 b2, . . . , w mbm )  Iˆ( w1, w2, . . . , w m, b ) ,

where wk = yk/µ k indicates the vector of relative incomes within region k, and b = ( b 1, b 2, . . . , b m ) , bk = µ k/µ, denotes the vector of relative mean incomes across regions. Equation (5) makes it clear that aggregate inequality in a country is completely accounted for by

4

differences in relative income within regions (as captured in the vectors wk ) and differences in relative mean income between regions (as captured by b). In this context, it is natural to regard the inequality contribution of wk as the amount by which aggregate inequality falls if relative incomes in region k are equalised, ceteris paribus; and the contribution of b as the amount by which aggregate inequality falls if regional mean income differences are eliminated, holding constant relative incomes within regions (ie by proportionately scaling incomes within each region until the regional mean matches the population value). For the index E0, the within and components (4a) and (4b) conform to these interpretations. Furthermore, the values of the contributions are invariant to the order in which within- and between-group differences are eliminated. Other inequality indices drawn from the entropy family (1) satisfy a similar decomposition equation given by m

(6)

c

m

Ec( y) = Ec ( y1, y2, . . . , y m ) = M νk bk Ec( y k)  M νk φc ( b k ) = W + B, k1

k1

which again leads to a natural interpretation in terms of the within- and between-group contributions to inequality: m

(7a)

c

W = M νk b k Ec( y k) ; k1 m

(7b)

B = M νk φc ( b k ) = Ec ( y 1, y 2, . . . , y m) . k1

However, the decomposition provided by (6) is less satisfactory than that given by (3) for two reasons. First, while the within-region term remains a weighted sum of regional inequality values, the weights typically do not sum to one, unless c = 0 or 1. (The latter corresponds to the Theil coefficient, where the weights are given by the regional income shares). So it is usually wrong to interpret W as the average level of inequality within regions. Secondly, the within-group component now depends both on within-group differences and (via the weights) on betweengroup differences. So any attempt to eliminate between-group variation along the lines suggested following equation (5) now has an indirect effect on the value of the within-group term. As a consequence the quantitative impact of eliminating within- and between-group variations is now

5

sensitive to the order in which the factors are considered.3 The main appeal of the decomposition provided by (6) rests on the fact that the inequality indices are subgroup consistent in the following sense: holding regional mean incomes and population sizes fixed, an increase in inequality within each region must lead to an increase (or, at least, not a decrease) in inequality in the country as a whole. This property is evidently true for the entropy measures, since the ceteris paribus clause implies that the between-group term B is constant in (6), and that the rise in regional inequality translates into a rise in the weighted sum of regional inequality values captured by the within-group component W. Subgroup consistency is an intuitively appealing and relatively weak property satisfied by the Atkinson class of inequality measures as well as the Entropy family.4 However the Gini coefficient is not subgroup consistent, and therefore not amenable in general to a decomposition along the lines of (6). This has not discouraged many researchers from attempting to decompose the Gini index in specific contexts or using different principles. The method which most closely resembles (6) can be formulated by numbering the regions in order of increasing mean incomes, and by supposing that person i occurs in the ith position when the income distribution is written y  ( y1 , y2 , ... , y m) , and in position ri when all incomes are arranged in increasing order. The value of the Gini coefficient is then given by (8)

G (y) 

2 n 2µ

M r i ( yi  µ)

i N

and yields the decomposition equation (see, for example, Lambert and Aronson 1993)

3

This special property of E0 corresponds to the ‘path independence’ property discussed by Foster and Shneyerov (2000). They observe that there are two ways of deriving W and B. As discussed earlier, the first obtains the between-group B contribution as the level of inequality which results after within-group inequality is eliminated by redistributing incomes equally within each region. W is then taken as the residual. The second defines W to be the level of inequality which results when inequalities between groups are eliminated by proportionally scaling each subgroup distribution until it has the same mean as the overall distribution, with the residual now taken to be B. The decomposition is said to be path-independent if these two methods produce identical results. 4

See Shorrocks (1988) for a more detailed discussion. 6

G  G (y1, y2 ... , y m)  (9)



2 n 2µ

2 n 2µ

m

M M ri ( yi  µ)

k  1 i Nk

m

M  M i ( yi  µ k)  M i ( µ k  µ)  M ( ri  i ) yi # i N i N i N

k1

k

k

k

 W  B  R, where (10a)

W 

2 n 2µ

m

m

2

k M M i ( yi  µ k)  M νk b k G (y )

k  1 i Nk

k1

is a weighted sum of the within-group inequality values, and (10b)

B 

2 2

n µ

m

m

k

m

k1

j1

jk

1 2 m M M i ( µ k  µ)  M bk νk w M νj  M νj ~ = G ( y , y , . . . , y )

k  1 i Nk

is the ‘between-group component’, representing the value of the Gini coefficient when the income of each individual is replaced by the mean income of the subgroup to which they belong. The final term, R, in equation (9) is a residual or ‘interaction’ effect which vanishes when the regional income ranges do not overlap (so that ri  i , for all i), and is otherwise strictly positive. When the regional income ranges are non-overlapping there is a very clear correspondence between the Gini decomposition (9) and the Entropy formulation (6); the only substantive c

2

difference is that the regional inequality weights are given by νk bk in (6) and by νk bk in (9). In this case also, it is natural to regard the ratio B/G as a measure of the proportional contribution of regional income variation to total inequality, mimicking the analogous expression B/Ec used in the context of Entropy indices. The situation becomes more problematic when the regional income ranges overlap, because the interaction term introduces a third, poorly specified, element into the decomposition equation (9). It is also important to note that (9) is not the only form of Gini decomposition on offer. Many other specifications have been suggested. The more recent proposals typically retain the division into within-, between-, and interaction terms, but differ in the formulae used for each of the components.5 In the absence of an obviously superior alternative, we proceed on the assumption that the between-group term B defined in (10b)

5

For early contributions, see Soltow (1960), Bhattacharya and Mahalanobis (1967), Rao (1969), Mangahas (1975) and Pyatt (1976). More recent developments include Silber (1989), Yitzhaki and Lerman (1991), Yitzhaki (1994) and Sastry and Kelkar (1994). 7

expressed as a proportion of the overall Gini value captures what we mean by the importance of the contribution of average income variation by region to total inequality as measured by the Gini coefficient.6 Another departure from the traditional decomposition framework explores the implications of generalising the notion of average regional income to measures other than the mean. The idea dates back to Blackorby et al. (1981) but has recently been explored in greater detail by Foster and Schneyerov (2000). In this framework the between-group term is constructed by replacing the income of each individual with a suitably defined representative income level for the region. Employing representative income levels other than the mean expands the set of inequality indices that have simple and attractive decomposition properties. This opens up some interesting lines for future research, but since the empirical applications have not yet seen the light of day they are not pursued further in this paper.

3. Some General Theoretical Results Despite the widespread use of decomposition techniques, little attention has been given in the past to establishing general decomposition results. The range of indices considered in the last section, combined with the possibility of alternative decomposition specifications, may have made it seem difficult to draw general conclusions about the way in which spatial factors impact on inequality. This turns out to be unduly pessimistic. The results below apply to any inequality index I() which satisfies (A1)-(A5), and document the properties of a ‘between-group term’ B defined by (11)

B = I ( y 1, y 2, . . . , y m) ,

in accordance with (4b), (7) and (10b) above. Consider first the range of values for B and how the value compares with the overall inequality

6

Note however that eliminating between-group inequality by scaling incomes within each subgroup until they match the population average will in general not only affect the within group term (as with most Entropy indices) but also the interaction component. 8

value I = I ( y1, y2, . . . , y m ) . Intuitively, when only one group is identified (i.e. m = 1) then average incomes do not vary across regions and B must be zero. At the other end of the scale, if the number of regions is the same as the size of the population (i.e. m = n), then each region contains a single observation and B must equal I. It is also reasonable to expect that these two cases represent the minimum and maximum values that B can take, so that: Proposition 1: (a) 0  B  I; (b) B = 0 if m = 1; (c) B = I if m = n. To establish Proposition 1, note that (b) and (c) are both immediate consequences of the definition of B given by (11), because B is the inequality value obtained by replacing the income of each person by their corresponding regional mean. In addition, we have B  0 because the index I(.) is always non-negative (by A5); and B  I because the ‘regionally equalised’ distribution ( y 1, y 2, . . . , y m) is obtained from the original distribution ( y1, y2, . . . , y m ) by applying an equalizing (and hence inequality reducing, by A2) procedure to each region in turn. The argument in support of Proposition 1 also suggests a non-decreasing relationship between the number of regions and the magnitude of the between-group term. An increase in the number of regions will increase the opportunities for differentiating between the regional mean values used in the calculation of B, thereby causing the value of B to rise. This is most easily seen by reversing the question and asking about the consequences of reducing the number of regions via a merger between two regions. The impact on the value of B is equivalent to that of a meanpreserving equalization of the two subgroup income levels which, by the principle of transfers (A2), cannot increase the value of B. Hence: Proposition 2: The value of B does not increase if any two regions are combined.7 The ‘finer partition’ characterisation in Proposition 2 is one way of capturing the idea that B increases monotonically with m. Another possible interpretation is that the between-group term is larger on average when more regions are identified, in other words: Proposition 3: The expected value of B increases with m.8

7

Note that successive application of Proposition 2 allows Proposition 1(a) to be derived from Proposition 1(b) and 1(c). 8

We are indebted to Ravi Kanbur for suggesting that this result may hold. 9

This proposal is still not well formulated, since it is not clear how the expectation is to be taken over the space of partitions and over the allocation of individuals to subgroups. For example, each of the partitions into m regions may be treated as equally likely, or they may be assigned a probability corresponding to the likelihood that this partition is observed when n individuals are randomly distributed across m categories. While a formal proof of Proposition 3 is beyond the scope of this paper, intuition suggests that the result must hold under a variety of interpretations for the following reason. For a fixed-size population, an increase in the number of regions causes the average size to fall, so the distribution shifts towards smaller sized regions. But, as the mean value of smaller samples exhibits greater variability, the net effect is an increase in the expected inequality value captured in the between-group term (11). The shift towards smaller sized classes can be formalised when n individuals are randomly allocated across m regions, each containing at least one person. The probability that a region contains r + 1 members (r  0) is then given by the multinomial value πr1(m) 

(12)

(nm)! 1 m1 nmr ( )r( ) , r  0, . . . , nm, r! (nmr)! m m

from which it follows that (13)

πr2(m) / πr1(m) 

nmr 1 nr1 1    πr2(m1) / πr1(m1). r1 m1 (r1)(m1) r1

In other words, the frequency of larger regions falls off faster as the number of regions increases. According to Proposition B.1 of Marshall and Olkin (1979, p.129), condition (13) ensures that π(m) is majorised by (ie Lorenz dominates) π(m+1) for all m, so that

(14)

s

s

r0

r0

M πr(m)  M πr(m1) , for all m and all s .

This is the formal sense in which the distribution of regions shifts towards smaller sizes as the number of regions increases. A similar condition is likely to hold when alternative methods are used to allocate a given population of individuals to a given number of groups. 10

Let us now fix the partition level (m), the sizes of regions {n1, ... , nm}, and the overall income distribution y = ( y1 , ... , yn ) , and consider what can be said about the way in which B depends on the allocation of individuals (and hence incomes) across the regions. The following two observations follow immediately from the definition (11) of the between-group term. Proposition 4: (a) if the distribution of income is the same in each region then B = 0; (b) if the regional mean incomes are all equal then B = 0. Note that the prerequisite in part (b) of Proposition 4 is significantly weaker than the corresponding requirement in part (a). Proposition 4(a) refers to the situation in which the subgroup distributions overlap to the greatest possible extent. It seems plausible to suppose that a reduction in the degree of overlap will translate into a smaller between-group term, but the precise relationship is difficult to formalise given that a reduction in overlap between two subgroups may not necessarily cause the subgroup means to move apart. At the other end of the scale, however, it is possible to establish that if the regional income ranges (strictly) overlap then the between-group term is not a maximum, and hence: Proposition 5: B is a maximum only if the regional income ranges do not overlap. The argument is as follows. Suppose that regions k and  have strictly overlapping distributions and that µ k  µ # Choose i  Nk and j  N# such that yi > yj. Then swapping the incomes yi and yj between the two regions raises the mean income in the ‘more affluent’ region , so the switch corresponds to a regressive Pigon-Dalton transfer between the two subgroup income levels which, by appeal to (A2), must increase the inequality value represented by B. Note that disjoint income intervals is a necessary condition for B to be a maximum, but it is not sufficient unless all subgroups have equal size. In other cases, it is necessary to consider how the different sized groups should be positioned in the income range. Drawing on the lessons of the similar exercise in Davies and Shorrocks (1989), it seems likely that the larger groups will be positioned at the centre of the income distribution, and that the subgroup sizes will decline

11

monotonically towards each tail.9

4. Empirical Evidence There is now a large empirical literature on inequality decomposition by population subgroups defined in terms of spatial location. The number of studies which report inequality decompositions using non-spatial elements (education, age, gender, etc.) is even greater. Given the focus of this paper, attention is confined here to spatial applications. But, as in the previous sections, the conclusions may well apply also in the non-spatial context. The type of questions we will attempt to address are as follows: Do any general patterns or conclusions emerge from the empirical literature? Is any empirical regularity observed in the relationship between the number of groups and the magnitude of the share of the between-group component? To what extent do the decomposition results depend on the measure of inequality? Are the results sensitive to the ‘income’ variable used in the analysis? Most empirical spatial decomposition studies use either the mean logarithmic deviation index E0 or the Theil Index E1. Tables 1A and 1B summarise the results obtained from applying the decomposition of E0 to many countries and points of time. Given the differences in sample size, choice of income variable, selection of regions, etc. reliable general conclusions are hard to draw. For this reason, it is safer to use the term ‘observation’ rather than finding or conclusion. Tables 1A and 1B here

Observation 1: The magnitude of the between-group component. As is typical of most subgroup decompositions, the between-group component is small relative to the within-group component except in the case of urban-rural divide (see Observation 2). This is particularly true when earnings data are used (see Observation 3).

9

Interestingly, Davies and Shorrocks (1989) show that the between-group component in the Gini decomposition can closely approximate total inequality with a relatively small number of subgroups, as long as the subgroup income ranges are non-overlapping and the group sizes are chosen optimally. 12

Excluding these two sets of circumstances, the share of the within-group component averages 12% with a minimum of 0% and a maximum of 51%. Some have concluded from this type of evidence that space or location is a relatively unimportant explanation of inequality (see, for example, Cowell and Jenkins (1995) ). However, before this conclusion is drawn, it should be noted that, as a determinant of inequality, space is poorly defined. Spatial location is often not of interest itself, but rather because of its association with many other important influences such as natural resources, weather conditions, cultural traditions, and even institutional arrangements. While some of these factors may contribute positively to the between-group component, uniform institutional arrangements such as nationwide policies are likely to make a negative contribution. Current procedures assign all of these factors to location without trying to disentangle the associated influences. The estimated between-group component cannot therefore be taken as a measure of the spatial contribution unless and until the definition of space is clarified. Furthermore, caution needs to be exercised when drawing policy implications from the empirical evidence. As noted by Kanbur (2002), if space is related to race or ethnicity, a small between-group component may not accurately reflect the significance of space as a determinant of inequality. Observation 2: The rural-urban divide. The rural-urban division seems always to produce a much larger between-group component. It ranges from 9% for Greece to as much as 78% for China. This latter result was obtained using regional level rather than household level data for China, and is therefore not strictly comparable. However, household-level data for China still yields a between-group component share of almost 38% (Lee 2000). Overall, the studies applying a rural-urban split to household data yield an average between-group component of 19.6%, almost 8% higher than the average reported in Observation 1 above. The between-group component depends on both the number of subgroups and differences in group means (or representative group values). Empirical evidence suggests that differences in means are the more important of these two factors, because the rural-urban distinction employs the minimum number of subgroups. Other spatially defined decompositions often involve a much larger number of groups but produces a smaller within-group component, as is evident from the Table 1A data for China, Indonesia and the Philippines. 13

What is the reason for the relatively large between-group component for the rural-urban divide? As mentioned earlier, the cause may well lie in the inability of current decomposition techniques to control for other variables. Lower prices and/or availability of home produced food in rural areas may not be fully reflected in the data on living standards. Furthermore, the ruralurban divide in developing countries is often associated with other differences linked to the provision of infrastructure, the development of capital markets, education, health care, and so on. Controlling for these effects is likely to lead to smaller between-group components for the pure spatial effect in the context of the rural-urban divide. In China, the rural-urban separation has been largely institutionalized. This separation results in dramatic differences in employment opportunities, education, infrastructure, health care, and access to capital and technology. Further the separation also causes different returns to these factors, because the markets are not integrated. Added together, it is not difficult to understand the reasons for the very large share of the between-group component in China. Observation 3: Alternative income concepts The data in Table 1A refer to income or consumption. Table 1B reports similar data for earnings in one country (the UK). The percentage share of the between-group component turns out to be much lower, ranging from 1% to 12%, with an average of 4.8%. Interestingly, total inequality in earnings is not smaller than inequality of income or consumption, which suggests that there is considerable wage differentials across occupations or sectors, but relatively little variation in occupational wages across locations. While it is dangerous to extrapolate from data from a single country, the same result may apply to other market economies where there are no constraints to migration, and where returns to labour and human capital are more or less equalized across space. Collective bargaining, the strength of labour unions and national wage setting policies may also be influential. Equal factor returns are not sufficient, of course, to produce a negligible between-group component in decompositions of earnings inequality: the employee structure of the workforce must also be similar across space. It would therefore be useful to decompose the within-group component further into a ‘returns’ effect and a ‘workforce structure’ effect, the former reflecting market development and migration, and the latter reflecting industrial structure.

14

Although the distribution of consumption is known to be more equal than income, this does not appear to carry over to the proportional contribution of the between-group component of income inequality. For example the between-group component is relatively small for the income observations from Finland and Switzerland. Observation 4: Alternative measures of inequality A number of empirical studies report decomposition results based on different inequality indices, enabling us to compare the percentage share of the within-group component across indices. The correlation coefficients are presented in Table 2. The correlation amongst the various Entropy measures tends to be quite high; the correlation with the Gini values are somewhat lower. Overall, Table 2 suggests that the results obtained using one index should broadly carry over to other indices. Table 2 here

Observation 5: Country coverage. Although spatial decompositions exist for the UK, USA, and some other developed countries, results on regional inequality are dominated by developing country evidence. The limited number of studies for developed countries does not imply that spatial inequality is not of interest in the developed world. However, the greater attention in developing or transition economies may reflect the fact that weak market forces, or restrictions on factor mobility, prevent returns to income generating factors from converging. In the search for explanations for the existence of spatial inequality, it may be useful compare the values of the between-group component obtained for developed and developing countries. Observation 6: Spatial price variations. The majority of empirical studies reported in Table 1 do not adjust for spatial price differences, although such differences exist and may substantially change the results for both developing countries such as China (see Wan 2001) and free market economies such as the US (see Ram 1992). Price levels are often correlated with living standards, so adjusting for spatial price differences will tend to lower the between-group term in the spatial decomposition while 15

not altering inequality within regions (although the within-group component may be affected in an unpredictable fashion due to a change in the weights). Thus, despite its relatively small value (see Observation 1 above), the reported share of the between-group component is likely to be exaggerated, particularly in countries with a big land mass and underdeveloped markets. Observation 7: The number of subgroups and the size of the between-group component. As discussed in Section 3, the between-group component is expected to rise as the number of groups increases. To examine the empirical evidence, Figures 1-4 present scatter plots of the share of the between-group component against the number of subgroups.10 The graphs do not show any obvious positive relationship; if anything, the reverse appears to be the case. This apparent conflict with the theoretical predictions is not completely surprising, however, because other factors are not held constant. In particular, comparability is compromised if different criteria are used to group the sample observations. This is easily detected in the results for China, Indonesia and the Philippines. In the relevant studies, the samples were divided into urban-rural areas as well as into regions. In doing so, the number of subgroups increases from 2 to 26 or 27 for Indonesia (2 to 13 for the Philippines, 2 to 3 or 26 for China), but the between-groupcomponent falls in most cases. This clearly indicates the dominant impact of differences in living standards between rural and urban residents, which more than offsets the contribution of the number of subgroups. Figures 1 - 4 here

To examine properly the positive relationship between the number of subgroups and the size of the between-group component requires progressive aggregation of subgroups. This has been done by Elbers et al. (2002) for Ecuador, Madagascar and Mozambique11 and by Cheng (1996) for China. Results from Elbers et al. indicate small increases in the between-group component, even if the number of groups increases dramatically. Using consumption data, Cheng (1996) reports a rise in the between-group component from 28% to 37% when the number of groups

10

To assist the visual presentation, observations with more than 27 subgroups are excluded: these will be discussed later. Also excluded are the results for the rural-urban divide in China, as they are treated as outliers. 11

These observations were excluded from Figures 1-4. 16

increases from 3 to 26. Using data on the gross value of industrial and agricultural outputs (GVIAO ), the corresponding change is from 39% to 51%. To explore further the relationship between the number of subgroups and the share of the between-group component, we employed the regression model: S = f(m, D), where S is the share of the between-group component, m is the number of subgroups, and D refers to a set of dummy variables to control for different income concepts, different inequality measures, and the rural-urban division versus other spatial partitions. To allow for possible nonlinearities, the model specification takes the Box-Cox form. The standard linear model always produced an insignificant parameter for the core variable m, but a simple χ2 test suggested preference for the Box-Cox model. Estimation results with the Box-Cox model are reported In Table 3, with E0 as the reference index. The results indicate that (a) the size of the between-group component is positively related to the number of subgroups at any conventional level of statistical significance; (b) increasing the number of groups by one leads on average to an increase of 0.07 in the percentage share of the between-group component; (c) earnings data tends to yield a smaller between-group component, (d) the urban-rural partition gives a larger between-group component; and (e) the Gini coefficient produces larger shares for the between-group component compared to other indices. Table 3 here

In summary, this section has attempted to present empirical results that may help future research, both empirical and theoretical. Many questions have been raised which require attention from theorists and empirical researchers. Of particular importance are the appropriate measure of spatial proximity; the relationship between the number of groups and the between-group component; the use of subgroup means or alternative measures of representative incomes; and the choice of inequality index.

17

5. Concluding remarks This paper has ranged over a number of theoretical and empirical issues linked to decomposition analysis in a spatial context. Various other issues have yet to be explored. For example, with the exception of Kanbur and Zhang (1999), there is little in the way of empirical literature on the time profile of the within or between-group component. Availability of data is an obstacle here. Nevertheless, a time profile would enrich the empirical literature by adding a dynamic dimension to the studies of spatial inequality decomposition. More could be done also to link inequality decomposition to the recent extensive literature on growth. In particular, examination of the pattern of the between-group component may be a better way of studying convergence than the commonly-used sigma convergence approach. Another set of issues requiring attention concern the underlying factors which ultimately contribute to spatial inequality, factors like economic geography (climate, natural resources), policy regimes, market orientation, and related socio-economic variables. Whether or not spatial differences persist or whither away over time is perhaps influenced most by the freedom to migrate, both within countries and internationally. The extent to which labour migration can help reduce regional disparities is an important question with obvious policy significance.

18

References

Akita, T., R. Lukman, A. Rizal and Y. Yamada: Inequality in the Distribution of Household Expenditures in Indonesia: A Theil Decomposition Analysis, Developing Economies. June 1999; 37(2): 197-221. Balisacan, A. and N. Fuwa: Is Spatial Inequality Increasing in the Philippines, Paper presented at the WIDER Conference on Inequality in Asia, 28-29 March 2003, Tokyo. Blackorby, C., D. Donaldson and M. Auersperg: A New Procedure for the Measurement of Inequality Within and Among Population Subgroups, Canadian Journal of Economics. 1981; 14: 665-685. Bourguignon, F. and C. Morrison: Inequality among World Citizens: 1820-1992, American Economic Review. 2002; 92(4): 727-744. Cheng, Y.: A Decomposition Analysis of Income Inequality of Chinese Rural Households, China Economic Review. Fall 1996; 7(2): 155-67 Cowell, F.A. and S.P. Jenkins: How Much Inequality Can We Explain? A Methodology and an Application to the United States, Economic Journal. March 1995; 105(429): 421-30 Davies, J. B., and A. F. Shorrocks: Optimal grouping of income and wealth data, Journal of Econometrics, 1989; 42, 97-108. Deutsch, J. and J. Silber: Inequality Decomposition by Population Subgroups and the Analysis of Interdistributional Inequality, in J. Silber, ed. Handbook of income inequality measurement. Boston; Dordrecht and London: Kluwer Academic, 1999; 363-97. Dickey, H.: Regional Earnings Inequality in Great Britain: A Decomposition Analysis, Regional Studies. October 2001; 35(7): 605-12. Elbers, C., P. Lanjouw, J. Mistiase, B. Ozler. and K. Simler: Are Neighbours Unequal? Estimating Local Inequality in Three Developing Countries, Paper presented at the WIDERCornell-LSE Conference on Spatial Inequality and Development, LSE, 28-30 June 2002. Ernst, C., M. Gerfin and R. E. Leu: Inequality Trends in the Swiss Income Distribution, Swiss Journal of Economics and Statistics. September 2000; 136(3): 289-305. Fischer S.: Globalization and Its Challenges, American Economic Review. 2003; 93(2): 1-30. Foster, J.E. and A.A. Shneyerov: Path Independent Inequality Measures, Journal of Economic Theory. April 2000; 91(2): 199-222. Gray, D., J.A. Mills, and S. Zandvakili: New Perspectives on Canadian National and Provincial 19

Income Inequality, Paper presented at the UNU-WIDER Conference on Inequality, Poverty and Human Well-Being, Helsinki, 30-31 May 2003 Jenkins, S. P.: Accounting for Inequality Trends: Decomposition Analyses for the UK, 1971-86, Economica. February 1995; 62(245): 29-63 Kanbur, R.: Notes on the Policy Significance of Inequality Decompositions, unpublished paper. July, 2002. Kanbur, R. and X. Zhang: Which Regional Inequality? The Evolution of Rural-Urban and Inland-Coastal Inequality in China from 1983 to 1995, Journal of Comparative Economics. December 1999; 27(4): 686-701. Kanhur, R. and X. Zhang: Fifty Years of Regional Inequality in China: A Journey through Central Planning, Reform and Openness, Paper presented at the UNU-WIDER Conference on Spatial Inequality in Asia, Tokyo, 28-29 March 2003. Kremer, M. and E. Maskin: Globalization and Inequality, unpublished manuscript, Harvard University. 2003. Lambert, P.J. and J. R.Aronson: Inequality Decomposition Analysis and the Gini Coefficient Revisited, Economic Journal. September 1993; 103(420): 1221-27 Lee, J.: Changes in the Source of China's Regional Inequality, China Economic Review. 2000; 11(3): 232-45. Loikkanen, H. A., M. Riihelä and R. Sullström: Regional income convergence and inequality in boom and bust: Results from micro data in Finland 1971-1998, and especially during the 1990s. Paper prepared for the meeting of the ESRC Urban and Regional Economics Seminar Group, 1-3 July, 2002, University of Central Lancashire, UK. Mangahas, M.: Income Inequality in the Philippines: a Decomposition Analysis. World Employment Programme Working Paper No. 12, Geneva, ILO, 1975. Marshall, A.W. and I. Olkin (1979): Inequalities: Theory of Majorization and its Applications. New York: Academic Press. Mehran, F.:. Decomposition of the Gini Index, A Statistical Analysis of Income Inequality. Income Distribution and Employment Program Working Paper, Geneva, ILO, 1974. Meagher, G.A. and P. B. Dixon: Analyzing Income Distribution in Australia, Economic Record. December 1986; 62(179): 427-41. Milanovic, B.: True World Income Distribution, 1988 and 1993: First Calculation Based on Household Surveys Alone, Economic Journal, January 2002, 112, 51-92. Milanovic, B. and S. Yitzhaki: Decomposing World Income Distribution: Does the World Have 20

a Middle Class? Review of Income and Wealth. June 2002; 48(2): 155-78. Mishra, P. and A. Parikh: Household Consumer Expenditure Inequalities in India: A Decomposition Analysis, Review of Income and Wealth. June 1992; 38(2): 225-36. Mookherjee, D. and A. F. Shorrocks: A Decomposition Analysis of the Trend in UK Income Inequality, Economic Journal. December 1982; 92(368): 886-902 Parker, S. C.: The Inequality of Employment and Self-Employment Incomes: A Decomposition Analysis for the U.K., Review of Income and Wealth. June 1999; 45(2): 263-74 Pyatt, G.C.: On the Interpretation and Disaggregation of Gini Coefficients, Economic Journal. 1976; 86: 243-255. Ram, R.: Interstate Income Inequality in the United States: Measurement, Modelling and Some Characteristics, Review of Income and Wealth. March 1992; 38(1): 39-48. Sala-i-Martin, X.: The Disturbing “Rise” of Global Income Inequality, NBER Working Paper 8904, April 2002. Sastry, D. V. S. and U. R. Kelkar: Note on the Decomposition of Gini Inequality, Review of Economics and Statistics. August 1994; 76(3): 584-86 Shorrocks, A. F.: The Class of Additively Decomposable Inequality Measures, Econometrica. 1980; 48: 613-625. Shorrocks, A. F.: Inequality Decomposition by Population Subgroups, Econometrica. November 1984; 52(6): 1369-85 Shorrocks, A. F.: Aggregation Issues in Inequality Measurement, in Eichhorn, W. (ed.). Measurement in Economics, Physica-Verlag. 1988, 429-451. Silber, J.: Factor Components, Population Subgroups and the Computation of the Gini Index of Inequality, Review of Economics and Statistics. February 1989; 71(1): 107-15 Silber, J. (ed.): Handbook of income inequality measurement. Boston; Dordrecht and London: Kluwer Academic, 1999. Tadjoeddin M., W. Suharyo and S. Mishra: Aspiration To Inequality: Regional Disparity And Centre-Regional Conflicts In Indonesia, Paper presented at the WIDER Conference on Inequality in Asia, 28-29 March 2003, Tokyo. Theil, H.: Economics and Information Theory. Amsterdam: North Holland Publishing Co., 1967. Theil, H.: Statistical Decomposition Analysis. Amsterdam: North Holland Publishing Co., 1972. Tsakloglou, P.: Aspects of Inequality in Greece: Measurement, Decomposition and Intertemporal Change: 1974, 1982, Journal of Development Economics. February 1993; 40(1): 53-74 21

Vanderpuve, O. J.: Spatial Inequality and Polarization in Ghana, 1987-99. Paper presented at the WIDER Conference on Spatial Inequality in Africa, Oxford University, 21-22 September 2002. Wan, G.H.: Changes in Regional Inequality in Rural China: Decomposing the Gini Index by Income Sources, Australian Journal of Agricultural and Resource Economics. 2001; 45(3): 361-82. Yemtsov, R.: Quo Vadis: Inequality and Poverty Dynamics Across Russian Regions in 19942000, Paper presented at the WIDER-Cornell-LSE Conference on Spatial Inequality and Development, LSE, 28-30 June 2002. Yitzhaki, S. and R. I. Lerman: Income Stratification and Income Inequality, Review of Income and Wealth. September 1991; 37(3): 313-29.

22

Table 1A: Spatial decomposition of E0: income or expenditure Country Canada

China

China

China

China

China

China

No. of groups 9 9 9 9 9 9 9 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 3 26 2 3 26 4 4 4 4 4 5

Year 1991 1992 1993 1994 1995 1996 1997 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994

Total Between Within inequality % % 0.264 1.9 98.1 0.272 1.5 98.5 0.273 1.8 98.2 0.272 1.5 98.5 0.272 1.8 98.2 0.281 1.8 98.2 0.312 1.6 98.4 0.079 6.5 93.6 0.076 6.6 93.5 0.075 6.0 94.0 0.080 6.3 93.7 0.083 6.7 93.4 0.089 8.0 92.0 0.088 7.2 92.8 0.091 7.5 92.5 0.098 9.1 90.9 0.108 11.6 88.4 0.112 12.9 87.1 0.117 14.7 85.3 0.120 17.3 82.7 0.079 78.1 21.9 0.076 75.8 24.2 0.075 77.0 23.1 0.080 74.5 25.5 0.083 74.8 25.2 0.089 74.7 25.3 0.088 73.3 26.7 0.091 74.9 25.1 0.098 75.5 24.5 0.108 73.5 26.5 0.112 75.1 24.9 0.117 73.3 26.7 0.120 70.7 29.3 0.330 37.7 62.3 0.330 28.0 72.0 0.330 36.8 63.2 0.390 25.8 74.2 0.390 39.0 61.0 0.390 51.5 48.5 0.141 0.070 0.075 0.232 0.139 0.222

24.0 20.0 9.0 9.0 4.0 39.0

76.0 80.0 91.0 91.0 96.0 61.0

23

Category

Source

Remarks

province

Gray et al. (2003)

survey data on total household income

coast/inland

Kanbur and regional data on Zhang (1999) per capita consumption expenditure

urban/rural

Kanbur and regional data on Zhang (1999) per capita consumption expenditure

urban/rural zone region urban/rural zone region county within: Jilin Shandong Sichuan Guangdong Jiangxi province

Lee (2000)

1994 county/city data on per capita consumption

Lee (2000)

1994 county/city data on per capita GVIAO

Cheng Y. (1996)

1994 household survey data on per capita income

Ecuador

Ecuador

Finland

Ghana

Greece

India

Indonesia

Indonesia

Indonesia

Madagascar

Madagascar

Mozambique

3 21 195 915 5 19 87 664

1994 1994 1994 1994 1994 1994 1994 1994

na na na na na na na na

0.0 1.3 5.9 14.1 6.6 7.3 8.6 23.3

4 4 4 4 4 3 6

1971 1981 1990 1993 1998 1996 1996

0.127 0.076 0.069 0.075 0.104 0.269 0.269

12.5 6.3 7.6 7.3 4.4 29.6 29.1

2 2 9 9 17 17 17 17 17 17 2 2 2 27 27 27 26 26 26 26 26 26 6 104

1974 1982 1974 1982 1977 1983 1977 1983 1977 1983 1987 1990 1993 1987 1990 1993 1990 1993 1996 1998 1999 2002 1993 1993

0.196 0.160 0.196 0.159 0.277 0.182 0.214 0.164 0.219 0.180 0.228 0.223 0.239 0.232 0.227 0.243 0.223 0.239 0.216 0.172 0.190 0.233 N/A N/A

1117 6 103

1993 1993 1993

131 424 146 11

100.0 98.7 94.1 85.9 93.4 92.7 91.4 76.7

rural region rural province rural Canton rural parroquia urban region urban province urban Canton urban zonas

Elbers et al. (2002)

1994 estimated household data on per capita expenditure

Elbers et al. (2002)

1994 estimated household data on per capita expenditure

87.5 region 93.7 92.4 92.7 95.6 70.4 residence area 70.9 residence area

Loikkanen et al. (2002)

household data on per capita income

Jacqueline (2002)

10.1 9.3 12.4 8.7 5.0 5.3 10.9 8.2 1.8 3.4 22.3 22.0 25.2 15.1 15.0 17.3 13.0 15.0 21.0 22.0 21.0 15.0 4.8 15.4

89.9 90.7 87.6 91.3 95.0 94.7 89.1 91.8 98.2 96.6 77.7 78.0 74.8 84.9 85.0 82.7 87.0 85.0 79.0 78.0 79.0 85.0 95.2 84.6

Tsakloglou (1993)

household data on per capita consumption household data on per capita consumption

N/A N/A N/A

18.1 7.7 21.7

81.9 92.3 78.3

1993 1996

N/A N/A

23.2 22.0

76.8 78.0

1996 1996

N/A N/A

18.4 9.3

81.6 90.7

24

urban/rural region region rural region

Mishra and household data on Parikh (1992) per capita consumption

urban region urban/rural

Akita et al. (1999)

household data on per capita expenditure

province

Akita et al. (1999)

household data on per capita expenditure

province

Tadjoeddin (2003)

household data on per capita expenditure

rural faritany rural fivondrona rural firaisana urban faritany urban fivondrona urban firaisana administrative post district province

Elbers et al. (2002)

1993 estimated household data on per capita expenditure 1993 estimated household data on per capita expenditure estimated household data on per capita expenditure

Elbers et al. (2002)

Elbers et al. (2002)

Phillipines

Phillipines

Russia

Switzerland

2 2 2 2 2 13 13 13 13 13 77 77 77 77 77 77 3 3

1985 1988 1991 1994 1997 1985 1988 1991 1994 1997 1994 1995 1996 1997 1998 1999 1982 1992

0.282 0.264 0.306 0.260 0.303 0.282 0.264 0.306 0.260 0.303 0.297 0.282 0.316 0.337 0.314 0.329 0.136 0.159

17.2 16.6 16.3 15.6 17.5 15.4 13.0 17.6 13.5 15.1 25.0 27.0 26.0 23.0 28.0 31.0 0.2 0.6

82.8 urban/rural 83.4 83.7 84.4 82.5 84.6 region 87.0 82.4 86.5 84.9 75.0 region 73.0 74.0 77.0 72.0 69.0 99.8 99.4

Balisacan and family income and Fuwa (2003) expenditure survey data on per capita expenditure Balisacan and family income and Fuwa (2003) expenditure survey data on per capita expenditure Yemtsov (2002)

household budget survey data on per capita income

Ernst et al. (2000)

household data on per capita income

Table 1B: Spatial decomposition of E0: earnings Country UK

UK

UK

UK

No. of Year Total Between Within groups inequality % % 12 1979 0.260 1.0 99.0 12 1985 0.310 1.8 98.2 12 1991 0.320 2.5 97.5 12 1994/5 0.330 2.4 97.6 12 1979 1.850 2.5 97.5 12 1985 0.650 3.0 97.0 12 1991 0.780 9.8 90.2 12 1994/5 1.520 3.0 97.0 11 1975 0.095 3.2 96.8 11 1980 0.094 4.3 95.7 11 1991 0.133 6.8 93.2 11 1995 0.152 7.2 92.8 11 1991 0.213 12.2 87.8 11 1996 0.286 7.7 92.3

25

Category

Source

Remarks

region

Parker (1999) FES data on employee income

region

Parker (1999) FES data on selfemployment income

region

Dickey (2001) New Earnings Survey (individual) data

region

Dickey (2001) British Household Panel Survey (individual) data

Table 2. Correlation between shares of the between–group component

E0

E1

E2

Gini

0.98

0.83

0.65

0.98

0.64

E1 E2

0.75

Table 3. Estimation results Coefficient estimate

T-ratio

Significance level

0.13

3.65

0.00

0.17

E1

-0.31

-1.12

0.26

-0.03

E2

-0.39

-0.50

0.62

-0.00

3.51

8.20

0.00

0.13

-1.50

-5.09

0.00

-0.13

1.73

4.74

0.00

0.11

2.63

12.01

0.00

1.24

Variable number of groups

Box-Cox elasticity

dummy for

Gini dummy for earnings `urban-rural constant

R2 = 0.39

Sample size = 185

26

Figure 1. Share of between component and number of groups: E0 50

% share

40 30 20 10 0 0

5

10

15

20

25

Number of groups

Figure 2. Share of between component and number of groups: E1

50

% share

40 30 20 10 0 0

5

10

15

Number of groups

27

20

25

Figure 3. Share of between component and number of groups: E2

50

% share

40 30 20 10 0 0

5

10

15

20

25

Number of groups

Figure 4. Share of between component and number of groups: Gini 80 70

% share

60 50 40 30 20 10 0 0

5

10

15

Number of groups

28

20

25

Suggest Documents