2
Confidence Interval Estimation for a Difference Between Two Dependent Intraclass Correlation Coefficients With Variable Class Sizes
4
Danuta Kowalik a , Yun-Hee Choi a , G Y Zou a,b,c,d , a Department
6
b Robarts 8
of Epidemiology and Biostatistics, University of Western Ontario, London, Ontario, Canada N6A 5C1
Clinical Trials, Robarts Research Institute, University of Western Ontario, London, Ontario, Canada N6A 5K8
c Department 10
of Epidemiology and Biostatistics, School of Public Health, Southeast University, Nanjing, Jiangsu Province, PR China
d Corresponding 12
14
to: GY Zou, Robarts Clinical Trials, Robarts Research Institute, P. O. Box 5015, 100 Perth Drive, London, Ontario, Canada N6A 5K8; Tel: 1-519-663-5777 Ext 24092; fax: 1-519-661-3766. E-mail:
[email protected].
Abstract Four confidence intervals for a difference between two dependent intraclass correlation
16
coefficients (ICCs) are developed, focusing on applications to family studies. The basic idea adopted here is that confidence intervals for a difference between two parameters can
18
be obtained from confidence limits for each parameter. Among the four confidence procedures considered, the one based on the inverse hyperbolic tangent transformation for a
20
single ICC performed best. Francis Galton’s data on human stature are used to illustrate the methodology.
22
Key words: Family studies, Fisher’s z-transformation, Heritability, Interval estimation, Intraclass correlation coefficient, Sib-sib correlation.
Preprint submitted to Journal of Statistical Theory and Practice
26 March 2011
24
1 Introduction
The intraclass correlation coefficient (ICC) is an important parameter in several fields of 26
research. In measurement studies, it is used to quantify reliability, where observations may be made on a sample of several assessors (Bartko, 1966). In genetic research, the ICC is
28
commonly used to measure the degree of familial resemblance with respect to environmental or biological traits (Haggard, 1958). The ICC also plays a key role in adjusting for clustering
30
effects in the design and analysis of clinical trials that randomize intact social units (Donner and Klar, 2000). See Donner (1986) and McGraw and Wong (1996) for extensive reviews on
32
the topic. With few exceptions (Donner et al., 1984; Donner and Zou, 2002; Giraudeau et al., 2005),
34
relatively little research has appeared on the inference involving dependent ICCs. Even if such a problem does arise, researchers tend to turn the question into comparing independent
36
ICCs. For example, Naik and Helu (2007) recently analyzed Francis Galton’s familial data on human stature (Hanley, 2004) by randomly splitting the data into two parts. One part
38
was used to compute ICC among boys and the other ICC among girls. The inference on gender difference in terms of similarity was then conducted using hypothesis procedures for
40
independent ICCs. Furthermore, despite that confidence intervals are more informative than hypothesis testing,
42
it is very rare to find that inference on a difference between two ICCs is conducted with confidence intervals. A confidence interval for the difference between the sister-sister and
44
brother-brother ICCs would be more informative than a P-value resulted from hypothesis testing.
46
Ramasundarahettige et al. (2009) recently applied a general confidence interval procedure for a difference to the case of dependent ICCs in the context of reliability studies having
48
constant class sizes. Since the basic idea is to use confidence limits for single ICCs to recover variance estimates needed for a difference, we refer to the method as the MOVER, method 2
50
of variance estimates recovery (Zou and Donner, 2008; Zou, 2008). The procedure based on exact confidence limits for a single ICC from the F-distribution performed best. However,
52
the results may not hold for studies with variable class sizes in which Donner and Wells (1986) have found that the confidence interval for a single ICC based on F-distribution does
54
not have superior performance. The purpose of this article is to adopt a similar approach to situations where the single ICCs
56
are computed with variable class sizes, focusing on the case of family studies. We thus considered four procedures for a difference between two ICCs using confidence limits for single
58
ICCs obtained from 1) variance formula by Smith (1957); 2) Fisher’s z-transformation; 3) the inverse hyperbolic tangent (arctanh) transformation; and 4) an approximate F-distribution
60
(Thomas and Hultquist, 1978). Application of the MOVER resulted in four confidence interval procedures for a difference between two dependent ICCs. As in Ramasundarahettige
62
et al. (2009), we take into account dependency between two ICCs using a result from Elston (1975).
64
After setting up notations in Section 2, we summarize the four common procedures for a single ICC with variable class sizes in Section 3. Section 4 then applies the MOVER to ICCs.
66
In Section 5, we report on a simulation study evaluating the performance of the procedures. An illustration of the procures is done using Galton’s data in Section 6. The article finishes
68
with a discussion.
2 Notation and Terminology
70
Let Y i = (Yi1 ,Yi2 , . . .,Yibi ,Yi,bi +1 ,Yi,bi +2 , . . .,Yi,bi +si ) denote measurements on the ith family, i = 1, 2, . . ., k, where Yi1 ,Yi2 , . . . ,Yibi are the scores on
72
the bi brothers and Yi,bi +1 , . . . ,Yi,bi +si are the scores on the si sisters, respectively. We assume 3
the following model holds Y i ∼ N(µ i , Σi ), 74
where µ = (µb 1Tbi , µs 1Tsi ) and
[(1 − ρb)I bi + ρb J bi ]σb2 ρbs σb σs J bi ×si Σi = ρbs σb σs J Tbi ×si [(1 − ρs)I si + ρs J si ]σs2 In addition, here 1bi and 1si are the column vectors with all the bi and si elements equal to 1, 76
I bi and I si are bi × bi and si × si identity matrices, J bi and Jsi are bi × bi and si × si matrices with all the elements equal to 1 and, J bi ×si is a bi × si matrix for which all elements equal
78
1. A necessary and sufficient condition for Σi to be positive definite, for all bi and si , is that 2 < ρ ρ . Under this model, it is assumed that the two intraclass correlation coefficients ρbs b s
80
ρb and ρs , and the interclass correlation coefficient ρbs are constant among families in the population of studies, and in particular they are independent of sibling size bi + si . Our main
82
interest is in developing and comparing methods for confidence intervals for ρb − ρs . In the case of constant class size, i.e., bi = b and si = s, for i = 1, 2, . . ., k, Elston (1975)
84
derived the maximum likelihood estimators for ρb , ρs, and ρbs . In the case of variable class sizes, Donner and Koval (1980) found that analysis of variance (ANOVA) estimators for
86
single ICCs are more stable. Let Yi j , for j = 1, . . ., bi , denote observation on jth member in the ith class, with class specific mean for brothers given by Y ib = ∑ j Yi j . The ANOVA
88
estimator for ρb is given by
where
ρbb =
MSAb − MSEb , MSAb + (b0 − 1)MSEb
∑i bi (Y ib − ∑i ∑ j Yi j /N)2 MSAb = k−1 90
and ∑i ∑ j (Yi j −Y ib )2 MSEb = N −k 4
are the mean square errors between and within classes, respectively, b0 = 92
N − Σki=1 b2i /N , k−1
and N = ∑ bi is the total number of observations for brothers across families. The estimator for ρs is given analogously. Rosner (1982) suggested estimating ρbs by computing the Pear-
94
son product-moment correlation over all possible brother-sister pairs. This estimator is given by b
b +s
i ∑ j′i=bii +1 (Yi j − Yeb )(Yi j′ − Yes ) ∑ki=1 ∑ j=1
96
where
, ρbbs = q +si i 2] e ′ (Y − Y ) (Yi j − Yeb )2 ][∑ki=1 bi ∑bj′i=b [∑ki=1 si ∑bj=1 s ij i +1 ∑ bi siY ib ∑ bi siY is Yeb = i and Yes = i . ∑i bi si ∑i bi si
98
3 Confidence intervals for a single intraclass correlation coefficient
As mentioned above, confidence interval for a difference between two ICCs, ρb − ρs , relies 100
on reliability of confidence interval for single ICCs. We therefore summarize four procedures for a single ICC, ρb . The results are also applicable to ρs .
102
The most intuitive interval estimation approach is to evoke the central limit theorem in conjunction with Slutzky’s theorem, by which confidence limits are given by
104
lSA , uSA = ρbb ± zα /2
q
c ρbb ), var(
where zα /2 is the upper α /2 quantile of the standard normal distribution, and 2(1 − ρb )2 {1 + ρb(b0 − 1)}2 var(ρbb ) = N −k b20
(k − 1)(1 − ρb){1 + ρb(2b0 − 1)} + ρb2 {∑ b2i − 2N −1 ∑ bi 3 + N −2 (∑ b2i )2 } + (k − 1)2 5
#
which was first derived by Smith (1957). Confidence interval for ρb is obtained by substitut106
ing ρbb for ρb in the variance formula. We refer to this method as the simple asymptotic (SA) method.
108
It is well-known that the sampling distribution of the ICC is left-skewed. A transformation due to Fisher (1925) is commonly used to make inference for ICC with constant class size.
110
The Fisher z-transformation is given by 1 1 + (b − 1)ρbb ln , 2 1 − ρbb
which is asymptotically distributed as 1 1 + (b − 1)ρb ln N , 2 1 − ρb 112
1 o 1n 1 . + 2 k−1 N −k
In the case of variable class size, Weinber and Patel (1981) found replacing b by b0 performed well. The corresponding confidence interval for ρb is given by lz =
114
116
118
120
exp(2lF ) − 1 , exp(2lF ) + b0 − 1
uz =
exp(2uF ) − 1 exp(2uF ) + b0 − 1
where
r n 1 1 + (b − 1)ρbb 1 1 1 o ± zα /2 lF , uF = ln . + 2 1 − ρbb 2 k−1 N −k An alternative approach to Fisher’s z-transformation is to apply the inverse hyperbolic tangent (arctanh) transformation, which Fisher (1925) suggested for the Pearson correlation, to
ρbb and derive its variance formula by the delta methods. By the central limit theorem, we have
1 1 + ρbb ln ∼N 2 1 − ρbb
1 1 + ρb var(ρbb) ln , 2 1 − ρb (1 − ρb2 )2
,
where var(ρbb ) can be obtained using the Smith formula given above. The confidence interval
by arctanh transformation is given by la = where
exp(2lt ) − 1 , exp(2l) + 1
ua =
exp(2ut ) − 1 , exp(2ut ) + 1
p c ρbb ) 1 1 + ρbb zα /2 var( lt , ut = ln . ± 2 1 − ρbb 1 − ρbb2 6
122
Finally, Thomas and Hultquist (1978) proposed an approximation to the exact confidence interval for ICC, given by le =
124
F ∗ /Fu − 1 , Fu + bh − 1
ue =
F ∗ /Fl − 1 Fl + bh − 1
where F ∗ is the conventional F-statistic computed using the unweighted sum of square of class means, given by bh ∑i (Y i. − ∑i Y i. /k)2 /(k − 1) , ∑i ∑ j (Yi j −Y i. )2 /(N − k) and bh is the harmonic mean of numbers of brothers across families, Fl and Fu are α /2 and F∗ =
126
1 − α /2 quantiles of the F-distribution with degrees of freedom k − 1 and N − k.
128
4 Confidence intervals for a difference between two dependent intraclass correlation coefficients
130
We first derive confidence interval for ρbb − ρbs assuming they are independent, and then extend to the case of dependent ICCs.
132
By the central limit theorem, the 100(1 − α )% confidence limits for ρb − ρs are given by
and
134
136
L = ρbb − ρbs − zα /2
p var(ρˆ b ) + var(ρˆ s)
U = ρbb − ρbs + zα /2
p
var(ρˆ b ) + var(ρˆ s)
where again zα /2 is the upper α /2 quantile of the standard normal distribution. The performance of (L,U ) depends on how the variances are estimated. Note that var(ρb) is a function of
ρ , indicating that variance of ρb depends on the parameter itself. Substituting ρb for ρ assumes
implicitly that point estimates are independent of variance estimates, resulting in symmetric 138
confidence intervals. In the spirits of score test (Rao, 1948; Bartlett, 1953, p. 15), we proceed by estimating variances needed for L and U in their respective neighborhoods.
140
Now, given confidence limits for ρb and ρs as (lb , ub ) and (ls, us ), respectively, we can show 7
142
that the distance between L and lb − us is given by q
q hq i
zα var(ρbb ) + var(ρbs) − var(ρbb ) + var(ρbs ) ,
which is shorter than that of L and ρbb − ρbs , given by
q
zα var(ρbb ) + var(ρbs) .
Thus, it is better to estimate variances needed for L under the assumption that ρb = lb and
144
ρs = us . Similar arguments suggest variances for U be estimated at ρb = ub and ρs = ls. By again the central limit theorem, we can recover the variance estimates needed for L and
146
U as follows. Since
q c ρbb ), lb ≈ ρbb − zα /2 var(
thus
148
150
at ρb = lb . Similarly
c ρbb ) = var(
at ρb = ub . Furthermore, we have
c ρbb ) = var(
at ρs = ls , and
c ρbs) = var(
at ρs = us . Therefore
c ρbs) = var( L = ρbb − ρbs − zα /2 = ρbb − ρbs − zα /2
152
Similar steps yield
= ρbb − ρbs −
U = ρbb − ρbs +
p
(ρbb − lb )2 zα2 /2
(ub − ρbb )2 zα2 /2 (ρbs − ls )2 zα2 /2 (us − ρbs )2 zα2 /2
p r
var(ρˆ b ) + var(ρˆ s) (ρbb −lb )2 zα2 /2
+
(us −ρbs )2 zα2 /2
(ρbb − lb )2 + (us − ρbs )2 .
q (ub − ρbb )2 + (ρbs − ls )2 . 8
Note that these confidence limits are in general asymmetric, as shown in Figure 1 (using the 154
Pythagorean theorem), unless confidence limits for ρb and ρs are symmetric. The approach can be easily extended in several ways. First, noting the confidence interval
156
for −ρs is given by (−us , −ls ), we have a confidence interval for ρb + ρs given by p L = ρbb + ρbs − (ρbb − lb )2 + (ρbs − ls )2 p U = ρb + ρbs + (u − ρb )2 + (us − ρbs )2 . b b b
This result is useful for constructing confidence intervals for linear contrasts of intraclass 158
correlation coefficients (Bhandary and Fujiwara, 2008). Second, when corr(ρbb , ρbs ) = r 6= 0, recalling that cov(ρbb, ρbs ) = r
160
p
var(ρbb )var(ρbs ), we have
p L = ρbb − ρbs − (ρbb − ls )2 + (us − ρbs )2 − 2b r (ρbb − lb )(us − ρbs ) p U = ρbb − ρbs − (ub − ρbb )2 + (ρbs − ls )2 − 2b r(ub − ρbb )(ρbs − ls )
where b r is the estimate of correlation between ρbb and ρbs , given by b r=
162
p
bh (bh − 1)sh (sh − 1) ρbbs [1 + (bh − 1)ρbb ][1 + (sh − 1)ρbs]
(1)
(2)
with ρbbs denoting Rosner’s pairwise estimator of interclass correlation coefficient, and bh and
sh being the harmonic means of numbers of brothers and sisters across families, respectively. As the key step of the derivation is the recovery of variance estimates from confidence limits,
164
we refer to this approach as the MOVER, method of variance estimates recovery (Zou and Donner, 2008; Zou, 2008).
166
168
By the MOVER, we can now obtain a confidence interval for ρb − ρs as follows: 1) Obtain ρbb (lb , ub ) and ρbs (ls , us) using procedures described in Section 3; 2) Obtain b r using (2);
3) Apply the MOVER in (1) to obtain confidence limits, (L,U ) for ρb − ρs . 9
170
5 Simulation Study
Since the MOVER is an asymptotic result, meaning that the finite sample properties are 172
intractable. We therefore assessed its performance using Monte Carlo simulation. We compared the performance of the four procedures, as developed from using the four
174
procedures for single ICCs: 1) the simple asymptotic approach; 2) Fisher’s Z-transformation approach; 3) the arctanh transformation; and 4) the method based on an approximation of
176
the F-distribution. The parameters for the simulation study included the number of families k, ρb , ρs , and ρbs .
178
The total number of families was chosen to be k = 50 and 100 to reflect small and moderate sample sizes. The variable sibship sizes (s = bi + si ) were generated using the negative
180
binomial distribution truncated below one given by Pr(s) =
(m + s − 1)!Q−1 (P/Q)s , (m − 1)!m!(1 − Q−m )
s = 1, 2, . . ., Q = 1 + P.
We chose the parameter values of m = 2.84 and P = 0.93, corresponding to the mean sibship 182
size of 3.12 and the variance of 4.52, reported by Brass (1958) for the United States in 1950. In order to avoid generating unreasonably large sibship sizes we truncated the extreme right
184
tails of sibship sizes at the probability of 0.999. To classify each sibling as a male or a female, a random number generator based on the binomial distribution was used with the probability
186
0.5. Parameter values for ρb and ρs were set to as (0.1, 0.1), (0.3, 0.1), (0.3, 0.3), (0.5, 0.1), (0.5, 0.3), (0.5, 0.5), (0.7, 0.3), (0.7, 0.5), (0.7, 0.7), (0.9, 0.5), (0.9, 0.7), (0.9, 0.9). For each
188
combination of ρb and ρs, we set ρbs to range from 0.1 to min(ρb , ρs ) with an increment of 0.2 such that the variance-covariance matrix Σi to be positive definite.
190
Without loss of generality µb = µs = 0 and σb2 = σs2 = 1, where µb , µs and σb2 , σs2 are the common mean and common variance of the observations obtained on brothers and sisters,
192
respectively. For each parameter combination, observations for brothers and sisters were generated from a multivariate normal distribution. The number of runs was set to 10000 10
194
such that the empirical coverage is expected (with 95% confidence interval) to vary between 94.6% to 95.4% for a nominal level of 95%. In the evaluation process, we considered three
196
criteria: empirical coverage, balance of left and right tail errors, and interval width. Table 1 presents simulation study results for k = 50. The simple asymptotic method based
198
on Smith’s variance formula tends to result in anti-conservative coverage for large difference between two ICCs, and conservative coverage for small difference of high ICC values. In
200
addition, it also results in unbalanced tails errors for high values of ICCs. It is clear that the methods based on Fisher’s Z-transformation and the arctanh transformation gives compara-
202
ble results when values of ICCs are small to moderate (≤ 0.5), providing good empirical coverage and balanced tail errors and competitive interval width. However, over the whole
204
parameter space, the arctanh transformation provided better results in terms of coverage and the balance of tail errors. Moreover, this method tends to provide narrower intervals, com-
206
pared to the simple asymptotic method. Confidence intervals based on the approximate Fdistribution appeared to have slightly conservative coverage with the largest interval width.
208
This observation contrasts to the results by Ramasundarahettige et al. (2009) who reported that the method based on the exact limits for single ICCs performed the best. However, the
210
observation here is expected, as Donner and Wells (1986) have concluded that the approximate F-distribution based method tends to be imprecise in the case of variable class sizes.
212
Similar conclusions can be drawn for k = 100, as shown in Table 2. The problems associated with the simple asymptotic method are reduced somewhat but still overt. The method based
214
on F-distribution is also slightly conservative.
6 Example: Galton’s data
216
We now illustrate the four procedures using Galton’s data on siblings height, available at http://www.epi.mgill.ca/hanley/galton (Hanley, 2004). The data consist of 205 fam-
218
ilies (486 sons, 476 daughters) with the sibship size ranging from 1 to 15. Among the 962 children, only 934 have have numeric values, which are the basis for this analysis. 11
220
222
Based on the data we obtained ρbb = 0.385, ρbs = 0.421, ρbb − ρbs = −0.036, and ρbbs = 0.264, yielding b r = 0.061. The confidence interval estimates of ρb , ρs and ρb − ρs using the four
methods are summarized in Table 3. Based on the results there is no significant difference between the sex-specific ICCs at the α = 5% level regardless of the method used, as the
224
resulting confidence intervals for ρb − ρs all contain zero. However, even for sample size
over 200, the differences among four confidence intervals for ρb − ρs are still distinguishable. 226
Consistent with the simulation, the method based on F-distribution provides wider intervals.
7 Discussion
228
We have developed four confidence intervals for a difference between two correlated ICCs, focusing on family studies. The results can be seen as a direct extension of Ramasundara-
230
hettige et al. (2009). Previous research has largely focused on hypothesis testing of equality (Donner et al., 1984), but confidence intervals are more informative as they indicate sta-
232
tistical significance as well as the magnitude of the difference, providing basis for judging practical importance.
234
The confidence intervals are simple to calculate, as only limits for single ICCs are required. The advantage of the MOVER is that it can take into account asymmetric sampling distri-
236
bution for ρb. Another advantage of the MOVER is that it can be used when the raw data
are not available but the summary in terms of point estimates of ICCs and their respective
238
confidence limits are given. It must be emphasized that our results are valid under several assumptions. First, the dis-
240
tribution of class sizes was obtained from families typically seen in United States in 1950. In the cases of greater imbalance, the conclusions here may not be applicable. One way to
242
reduce imbalance may be to exclude families with a single child in the analysis since Swiger et al. (1991) have suggested this can result in better estimates of ICC. The is also in accor-
244
dance with the recommendation by Eliasziw and Donner (1991) that interclass correlation 12
coefficient be estimated by placing a restriction on the family size. Such a strategy is most 246
useful in the case of large sample size. Second, the validity of the MOVER depends on that of confidence limits for single ICCs. The confidence intervals for single ICCs relied on the
248
assumption that the data are at least approximately normally distributed and the number of families is not too small. In situations when these assumptions may not hold, other proce-
250
dures should be sought. One possible solution is to first transform the data using a power transformation and then obtain the desired confidence interval using the proposed method.
252
However, due to the transformation the confidence intervals on the transformed scale may not be directly interpretable. Another possibility is to use the bootstrap. This procedure has
254
been widely recommended for non-normal data. For example, Ukoumunne et al. (2003) implemented a non-parametric bootstrap confidence interval for ICC with constant class size
256
in the context of cluster randomization trials. They showed that the application of bootstrapt method to the Fisher’s z-transformation of ICC performed well even with small sample
258
sizes. Evaluation of the MOVER and the bootstrap for the difference of two correlated ICCs of small values arising from small numbers of large clusters, however, is left for further
260
research.
Acknowledgments
262
References Bartko, J. J., 1966. The intraclass correlation coefficient as a measure of reliability. Psycho-
264
logical Reports 19, 3–11. Bartlett, M. S., 1953. Approximate confidence intervals. Biometrika 40, 12–19.
266
Bhandary, M., Fujiwara, K., 2008. Confidence interval estimation for a linear contrast in intraclass correlation coefficients under unequal family sizes for several populations. Journal
268
of Satatistical Computation and Simulation 78, 609–621. Brass, W., 1958. Models of birth distribution in human populations. Bulletin of International
270
Statistical Institute 36, 165–179. 13
Donner, A., 1986. A review of inference procedures for the intraclass correlation coefficient 272
in the one-way random effects model. International Statistical Review 54, 67–82. Donner, A., Klar, N., 2000. Design and Analysis of Cluster Randomization Trials in Health
274
Research. Arnold, New York. Donner, A., Koval, J. J., 1980. The estimation of intraclass correlation in the analysis of
276
family data. Biometrics 36, 19–25. Donner, A., Koval, J. J., Bull, S., 1984. Testing the effect of sex differences on sib-sib corre-
278
lations. Biometrics 40, 349–356. Donner, A., Wells, G., 1986. A comparison of confidence interval methods for the intraclass
280
correlation coefficient. Biometrics 42, 401–412. Donner, A., Zou, G., 2002. Testing the equality of dependent intraclass correlation coeffi-
282
cients. Statistician 51, 367–379. Eliasziw, M., Donner, A., 1991. A generalized non-iterative approach to the analysis of fam-
284
ily data. Annals of Human Genetics 55, 77–90. Elston, R. C., 1975. On the correlation between correlations. Biometrika 62, 133–140.
286
Fisher, R. A., 1925. Statistical Methods for Research Workers. Oliver and Boyd, Edinburgh. Giraudeau, B., Porcher, R., Mary, J., 2005. Power calculation for the likelihood ratio-test
288
when comparing two dependent intraclass correlation coefficients. Computer Methods and Programs in Biomedicine 77, 165–173.
290
Haggard, E. A., 1958. Intraclass Correlation and the Analysis of Variance. Dryden Press, New York.
292
Hanley, J. A., 2004. Transmuting women into men: Galton’s family data on human stature. American Statistician 58, 237–243.
294
McGraw, K. O., Wong, S., 1996. Forming inference about some intraclass correlation coefficients. Psychological Methods 1, 30–46.
296
Naik, D. N., Helu, A., 2007. On testing equality of intraclass correlations under unequal family sizes. Computational Statistics and Data Analysis 51, 6498–6510.
298
Ramasundarahettige, C. F., Donner, A., Zou, G. Y., 2009. Confidence interval construction for a difference between two dependent intraclass correlation coefficients. Statistics in
14
300
Medicine 28, 1041–1053. Rao, C. R., 1948. Large sample tests of statistical hypotheses concerning several parameters
302
with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society 44, 50–57.
304
Rosner, B., 1982. On the estimation and testing of intraclass correlations: The general case of multiple replicates for each variable. American Journal of Epidemiology 116, 722–730.
306
Smith, C. A. B., 1957. On the estimation of intraclass correlation. Annals of Human Genetics 21, 363–373.
308
Swiger, L. A., Harvey, W. R., Everson, D. O., Gregory, K. E., 1991. The variance of intraclass correlation involving groups with one observation. Biometrics 20, 818–826.
310
Thomas, J. D., Hultquist, R. A., 1978. Interval estimation for the unbalanced case of the one-way random effects model. Annals of Statistics 6, 582–587.
312
Ukoumunne, O. C., Davison, A. C., Gulliford, M. C., Chinn, S., 2003. Non-parametric bootstrap confidence intervals for the intraclass correlation coefficient. Statistics in Medicine
314
22, 3805–3821. Weinber, R., Patel, Y. C., 1981. Simulated intraclass correlation coefficients and their z-
316
transformations. Journal of Satatistical Computation and Simulation 13, 13–26. Zou, G. Y., 2008. On the estimation of additive interaction by use of the four-by-two tables
318
and beyond. American Journal of Epidemiology 168, 212–224. Zou, G. Y., Donner, A., 2008. Construction of confidence limits about effect measures: A
320
general approach. Statistics in Medicine 27, 1693–1702.
15
us ρb − ρs √Lower for 2 2
Upper for ρb + ρs √
(ρbb −lb ) +(us −ρbs )
(ub −ρbb )2 +(us −ρbb )2
ρbs ρbb
lb
Lower for ρb + ρs
√
ub
√
(ρbb −lb )2 +(ρbs −ls )2
Upper for ρb − ρs
(ub −ρbb )2 +(ρbs −ls )2
ls
Figure 1. Geometric illustration of margins of errors as obtained using the Method Of Variance Estimates Recovery (MOVER) for ρb − ρs and ρb + ρs , which are identical to that by the Pythagorean theorem. Note that a confidence interval is given by point estimate − lower margin of error, point estimate + upper margin of error
16
Table 1 Performance (based on 10000 runs) of four procedures for constructing two-sided 95% confidence interval for a difference between two dependent intraclass correlation coefficients for k = 50. ρb ρs ρbs 0.1 0.1 0.0 0.1
Simple asymptotic CV(ML, MR)WD 94.19(2.87,2.94)0.91 93.96(2.97,3.07)0.90
Fisher’s z CV(ML, MR)WD 94.63(2.67,2.70)0.87 94.32(2.82,2.86)0.87
Arctanh CV(ML, MR)WD 95.16(2.41,2.43)0.88 95.09(2.41,2.50)0.87
F-distribution CV(ML, MR)WD 96.27(1.78,1.95)1.17 96.16(1.94,1.90)1.17
0.3 0.1 0.0 0.1 0.3 0.0 0.1 0.3
94.25(2.71,3.04)0.87 94.56(2.59,2.85)0.87 94.50(2.75,2.75)0.84 94.10(2.98,2.92)0.84 94.86(2.58,2.56)0.83
94.34(2.49,3.17)0.84 94.81(2.30,2.89)0.84 94.43(2.77,2.80)0.81 94.21(2.97,2.82)0.81 94.90(2.50,2.60)0.80
94.98(2.05,2.97)0.85 95.43(1.90,2.67)0.85 95.15(2.37,2.48)0.82 94.79(2.67,2.54)0.82 95.45(2.25,2.30)0.81
96.13(1.78,2.09)1.10 96.41(1.54,2.05)1.10 96.30(1.81,1.89)1.02 96.25(1.88,1.87)1.02 96.35(1.76,1.89)1.01
0.5 0.1 0.0 0.1 0.3 0.0 0.1 0.3 0.5 0.0 0.1 0.3 0.5
94.82(2.53,2.65)0.81 94.32(2.74,2.94)0.81 94.45(2.50,3.05)0.78 94.68(2.62,2.70)0.78 95.65(2.03,2.32)0.77 94.92(2.67,2.41)0.71 94.93(2.33,2.74)0.71 95.36(2.46,2.18)0.70 96.38(1.87,1.75)0.69
94.68(2.28,3.04)0.78 94.29(2.43,3.28)0.78 94.06(2.62,3.32)0.75 94.36(2.58,3.06)0.75 95.46(1.96,2.58)0.74 93.69(3.29,3.02)0.69 93.92(2.94,3.14)0.69 94.41(2.98,2.61)0.68 95.43(2.33,2.24)0.67
95.41(1.76,2.83)0.80 94.78(2.12,3.10)0.79 94.70(2.23,3.07)0.77 95.08(2.27,2.65)0.77 96.06(1.72,2.22)0.76 94.69(2.78,2.53)0.71 94.82(2.45,2.73)0.71 95.11(2.64,2.25)0.70 96.16(1.94,1.90)0.69
96.10(1.61,2.29)1.01 95.73(1.91,2.36)1.01 95.65(1.81,2.54)0.92 96.18(1.85,1.97)0.92 96.91(1.36,1.73)0.92 95.92(2.18,1.90)0.82 95.95(1.87,2.18)0.82 96.39(1.96,1.65)0.81 96.75(1.70,1.55)0.80
0.7 0.3 0.0 0.1 0.3 0.5 0.0 0.1 0.3 0.5 0.7 0.0 0.1 0.3 0.5 0.7
93.98(2.66,3.36)0.69 93.94(2.39,3.67)0.69 94.84(2.08,3.08)0.69 94.64(2.19,3.17)0.62 94.79(1.96,3.25)0.61 95.32(1.89,2.79)0.61 95.53(1.51,2.96)0.60 96.04(1.88,2.08)0.50 95.95(2.19,1.86)0.50 96.28(1.77,1.95)0.50 97.26(1.53,1.21)0.49 97.90(1.09,1.01)0.48
93.59(2.72,3.69)0.67 93.47(2.45,4.08)0.67 94.77(2.14,3.09)0.67 93.32(2.98,3.70)0.60 93.52(2.80,3.68)0.60 94.02(2.61,3.37)0.59 94.52(2.05,3.43)0.59 93.39(3.22,3.39)0.49 93.46(3.49,3.05)0.49 93.64(3.04,3.32)0.49 94.83(2.74,2.43)0.48 96.01(2.08,1.91)0.48
94.49(2.47,3.04)0.69 94.56(2.16,3.28)0.68 95.42(1.90,2.68)0.68 94.46(2.58,2.96)0.62 94.50(2.36,3.14)0.62 95.10(2.25,2.65)0.62 95.48(1.78,2.74)0.61 94.87(2.46,2.67)0.52 94.74(2.78,2.48)0.52 95.09(2.33,2.58)0.52 96.23(1.97,1.80)0.51 96.88(1.60,1.52)0.50
95.19(2.03,2.78)0.81 95.23(1.90,2.87)0.81 95.74(1.73,2.53)0.81 95.35(2.02,2.63)0.70 95.43(1.84,2.73)0.69 95.67(1.93,2.40)0.69 96.06(1.46,2.48)0.68 95.35(2.25,2.40)0.55 95.48(2.41,2.11)0.55 95.72(2.01,2.27)0.55 96.47(1.82,1.71)0.54 97.17(1.46,1.37)0.54
0.9 0.5 0.0 0.1 0.3 0.5 0.7 0.0 0.1 0.3 0.5 0.7 0.9 0.0 0.1 0.3 0.5 0.7 0.9
93.19(1.82,4.99)0.52 93.29(1.80,4.91)0.52 93.49(1.70,4.81)0.52 93.82(1.60,4.58)0.51 94.78(0.84,4.38)0.38 94.05(1.06,4.89)0.38 94.65(0.82,4.53)0.38 94.40(0.98,4.62)0.38 95.30(0.61,4.09)0.37 97.80(0.96,1.24)0.20 98.01(0.96,1.03)0.20 97.82(1.10,1.08)0.20 98.35(0.77,0.88)0.19 98.53(0.77,0.70)0.19 99.19(0.37,0.44)0.19
93.56(2.71,3.73)0.50 93.74(2.68,3.58)0.50 93.86(2.66,3.48)0.50 94.24(2.54,3.22)0.50 93.86(2.67,3.47)0.37 93.10(3.04,3.86)0.37 93.90(2.59,3.51)0.37 93.51(2.94,3.55)0.37 94.98(2.05,2.97)0.36 92.94(3.37,3.69)0.20 93.18(3.44,3.38)0.20 92.82(3.40,3.78)0.20 94.11(2.84,3.05)0.20 94.88(2.63,2.49)0.19 96.29(1.63,2.08)0.19
94.55(2.49,2.96)0.52 94.75(2.44,2.81)0.52 94.83(2.40,2.77)0.52 95.11(2.32,2.57)0.52 95.12(2.13,2.75)0.39 94.75(2.52,2.73)0.39 95.14(2.14,2.72)0.39 94.93(2.35,2.72)0.39 96.21(1.60,2.19)0.38 94.90(2.39,2.71)0.22 95.27(2.43,2.30)0.22 94.87(2.51,2.62)0.22 95.77(1.91,2.32)0.21 96.25(1.87,1.88)0.21 97.34(1.15,1.51)0.21
94.67(2.39,2.94)0.58 94.57(2.30,3.13)0.58 94.99(2.18,2.83)0.58 95.41(1.94,2.65)0.58 95.48(1.76,2.76)0.41 95.06(2.14,2.80)0.41 95.24(1.89,2.87)0.41 95.49(1.94,2.57)0.41 96.25(1.36,2.39)0.40 95.38(2.02,2.60)0.21 95.37(2.38,2.25)0.21 95.50(2.16,2.34)0.21 96.00(1.80,2.20)0.21 96.48(1.90,1.62)0.20 97.59(1.05,1.36)0.20
†CV: coverage %; ML: % of intervals missing the parameter value from the left; MR: % of intervals missing the parameter value from the right; WD: interval width.
17
Table 2 Performance (based on 10000 runs) of four procedures for constructing two-sided 95% confidence interval for a difference between two dependent intraclass correlation coefficients for k = 100. ρb ρs ρbs 0.1 0.1 0.0 0.1
Simple asymptotic CV(ML, MR)WD 94.65(2.77,2.58)0.63 94.74(2.43,2.83)0.63
Fisher’s z CV(ML, MR)WD 94.85(2.70,2.45)0.62 94.95(2.31,2.74)0.62
Arctanh CV(ML, MR)WD 95.13(2.56,2.31)0.62 95.25(2.19,2.56)0.62
F-distribution CV(ML, MR)WD 96.41(1.91,1.68)0.82 96.43(1.61,1.96)0.82
0.3 0.1 0.0 0.1 0.0 0.3 0.1 0.3
94.70(2.67,2.63)0.61 94.98(2.52,2.50)0.61 94.93(2.57,2.50)0.59 94.70(2.72,2.58)0.59 95.55(2.43,2.02)0.58
94.73(2.49,2.78)0.60 94.93(2.41,2.66)0.60 94.73(2.70,2.57)0.57 94.41(2.88,2.71)0.57 95.30(2.55,2.15)0.57
95.09(2.24,2.67)0.60 95.40(2.06,2.54)0.60 95.19(2.40,2.41)0.58 94.99(2.54,2.47)0.58 95.72(2.35,1.93)0.58
96.26(1.70,2.04)0.76 96.18(1.77,2.05)0.76 96.26(1.87,1.87)0.70 96.08(2.00,1.92)0.70 96.51(1.85,1.64)0.70
0.5 0.1 0.0 0.1 0.3 0.0 0.1 0.3 0.5 0.0 0.1 0.3 0.5
94.65(2.76,2.59)0.57 94.61(2.68,2.71)0.57 94.65(2.59,2.76)0.54 95.19(2.28,2.53)0.54 95.45(2.12,2.43)0.54 95.13(2.35,2.52)0.50 94.91(2.67,2.42)0.50 95.55(2.13,2.32)0.49 96.50(1.64,1.86)0.49
94.48(2.56,2.96)0.55 94.44(2.41,3.15)0.55 94.15(2.67,3.18)0.53 94.72(2.45,2.83)0.53 94.99(2.23,2.78)0.52 94.13(2.88,2.99)0.48 93.79(3.23,2.98)0.48 94.50(2.60,2.90)0.47 95.62(2.03,2.35)0.47
94.96(2.29,2.75)0.56 94.91(2.16,2.93)0.56 94.82(2.43,2.75)0.54 95.28(2.12,2.60)0.54 95.57(1.97,2.46)0.54 95.10(2.38,2.52)0.50 94.89(2.65,2.46)0.50 95.44(2.18,2.38)0.49 96.43(1.68,1.89)0.49
95.87(1.80,2.33)0.70 95.55(2.01,2.44)0.70 95.92(1.91,2.17)0.63 96.12(1.75,2.13)0.63 96.32(1.60,2.08)0.63 96.20(1.86,1.94)0.55 95.90(2.12,1.98)0.55 96.28(1.91,1.81)0.55 96.90(1.44,1.66)0.54
0.7 0.3 0.0 0.1 0.3 0.5 0.0 0.1 0.3 0.5 0.7 0.0 0.1 0.3 0.5 0.7
95.23(2.15,2.62)0.48 94.81(2.06,3.13)0.48 94.71(2.33,2.96)0.48 95.02(2.17,2.81)0.43 95.25(1.98,2.77)0.43 95.35(1.90,2.75)0.43 96.07(1.64,2.29)0.42 95.71(2.06,2.23)0.35 95.13(2.45,2.42)0.35 95.94(2.05,2.01)0.35 96.62(1.69,1.69)0.34 97.45(1.32,1.23)0.34
94.68(2.36,2.96)0.47 94.34(2.25,3.41)0.47 94.13(2.58,3.29)0.46 93.93(2.76,3.31)0.41 93.89(2.81,3.30)0.41 94.05(2.74,3.21)0.41 95.13(2.22,2.65)0.41 93.41(3.10,3.49)0.34 92.94(3.57,3.49)0.34 94.04(2.91,3.05)0.33 94.77(2.76,2.47)0.33 95.79(2.17,2.04)0.32
95.34(2.15,2.51)0.48 95.07(2.04,2.89)0.48 94.87(2.32,2.81)0.48 94.98(2.41,2.61)0.43 95.11(2.34,2.55)0.43 95.20(2.18,2.62)0.43 96.07(1.85,2.08)0.42 94.97(2.43,2.60)0.36 94.40(2.82,2.78)0.36 95.41(2.32,2.27)0.35 96.12(2.00,1.88)0.35 96.98(1.57,1.45)0.34
95.75(1.99,2.26)0.56 95.52(1.83,2.65)0.56 95.55(2.07,2.38)0.55 95.41(1.95,2.64)0.47 95.89(1.85,2.26)0.47 95.66(1.99,2.35)0.47 96.41(1.64,1.95)0.46 95.63(2.11,2.26)0.37 95.05(2.51,2.44)0.37 95.93(2.02,2.05)0.36 96.20(2.06,1.74)0.36 97.07(1.45,1.48)0.36
0.9 0.5 0.0 0.1 0.3 0.5 0.7 0.0 0.1 0.3 0.5 0.7 0.9 0.0 0.1 0.3 0.5 0.7 0.9
94.08(1.83,4.09)0.36 94.27(1.83,3.90)0.36 94.29(1.71,4.00)0.36 94.60(1.58,3.82)0.36 94.64(1.50,3.86)0.26 94.35(1.23,4.42)0.26 94.64(1.24,4.12)0.26 95.41(0.99,3.60)0.26 95.83(0.88,3.29)0.26 96.50(1.79,1.71)0.13 96.55(1.80,1.65)0.14 96.87(1.50,1.63)0.13 97.06(1.46,1.48)0.13 97.52(1.13,1.35)0.13 98.49(0.81,0.70)0.13
93.65(3.05,3.30)0.35 94.17(2.72,3.11)0.35 93.98(2.69,3.33)0.35 94.35(2.55,3.10)0.35 93.58(2.93,3.49)0.25 93.19(2.93,3.88)0.25 93.74(2.78,3.48)0.25 94.26(2.55,3.19)0.25 94.75(2.38,2.87)0.25 93.08(3.53,3.39)0.13 92.98(3.35,3.67)0.13 93.60(3.28,3.12)0.13 93.97(3.04,2.99)0.13 94.73(2.54,2.73)0.13 96.39(1.86,1.75)0.12
94.74(2.70,2.56)0.36 95.06(2.37,2.57)0.36 94.95(2.42,2.63)0.36 95.46(2.14,2.40)0.36 95.15(2.25,2.60)0.27 94.82(2.31,2.87)0.27 95.24(2.23,2.53)0.27 95.90(1.81,2.29)0.26 95.86(1.96,2.18)0.26 95.03(2.55,2.42)0.14 94.97(2.53,2.50)0.14 95.42(2.23,2.35)0.14 95.72(2.11,2.17)0.14 96.34(1.74,1.92)0.14 97.59(1.31,1.10)0.13
95.06(2.31,2.63)0.40 95.06(2.22,2.72)0.40 95.25(1.99,2.76)0.39 95.19(2.10,2.71)0.39 95.05(2.29,2.66)0.27 95.10(1.98,2.92)0.27 95.44(1.93,2.63)0.27 95.88(1.72,2.40)0.27 96.04(1.73,2.23)0.27 95.16(2.53,2.31)0.13 95.14(2.39,2.47)0.14 95.56(2.23,2.21)0.13 95.87(2.04,2.09)0.13 96.67(1.66,1.67)0.13 97.75(1.21,1.04)0.13
†CV: coverage %; ML: % of intervals missing the parameter value from the left; MR: % of intervals missing the parameter value from the right; WD: interval width.
18
Table 3 Two sided 95% confidence intervals for ρb − ρs using four different methods of confidence interval estimation for sex-specific ICCs Method ρbb = 0.385 ρbs = 0.421 ρbb − ρbs = −0.036 Simple asymptotic Fisher’s z arctanh F-distribution
ρbs = 0.264,b r = 0.061 (0.282, 0.488) (0.315, 0.527) (-0.179, 0.107) (0.284, 0.481) (0.317, 0.517) ( -0.171, 0.101) (0.278, 0.483) (0.310, 0.521) ( -0.178, 0.107) (0.274, 0.497) (0.301, 0.533) ( -0.189, 0.123)
Note: Mean (SD) for sons: 69.23(2.63) inch, N = 481. Daughters: 64.10(2.4) inch, N = 453
19