Regression-based reference limits and their ... - Semantic Scholar

0 downloads 0 Views 333KB Size Report
Though partitioning seems to be a simple principle, it includes certain prob- lems. .... In a simple linear regression situation with only one independent variable (p.
Clinical Chemistry 44:2 327–335 (1998)

Laboratory Management

Regression-based reference limits and their reliability: example on hemoglobin during the first year of life Arja Virtanen,1* Veli Kairisto,1,2 Kerttu Irjala,1 Allan Rajama¨ki,1 and Esa Uusipaikka3

parametric or nonparametric methods from an univariate distribution [1, 2]. Many biochemical quantities are dependent on sex, age, and other known characteristics. The instruction that the IFCC and NCCLS documents give for such situations is to divide the reference values into homogeneous subgroups [1, 2]. Though partitioning seems to be a simple principle, it includes certain problems. How does one define the appropriate cutoff points between the subgroups? Are the resulting subgroups large enough for the calculation of reliable reference limits? Regression analysis has been used by several investigators for the estimation of age-dependent reference limits when the reference groups are so small that the resulting subsets are not large enough for the calculation of valid reference limits. For example, during early childhood the physiological changes occur over short time intervals, and it may be practically impossible to get a sufficient number of reference values to establish reference limits for each narrow age group. Harris and Boyd [3] considered statistical criteria for partitioning data sets in a suitable manner to obtain separate reference limits. However, even after partitioning there may still exist dependency between analyte and, for example, age in some subgroups. In those situations the use of regression analysis has been suggested for the calculation of percentiles or prediction intervals to be considered as reference limits. However, the theoretical basis of reference intervals and prediction intervals is different. If reference limits are required, they must be estimated, i.e., parameters of the reference distribution are estimated. Conversely, prediction intervals are calculated for random variables, not parameters. Irjala et al. [4] estimated reference limits for IgA, IgG, and IgM in serum in children ages 6 months to 14 years with first- or second-order regression models. If the variance of residuals was age-dependent, then they calculated age-specific standard deviations, which were used when age-related reference intervals were deter-

Calculation of reference limits by regression analysis makes it unnecessary to partition the reference data into subgroups, and age-dependent limits can be estimated within as narrow age intervals as necessary. However, the reliability of the regression-based reference limits has not been considered before. To get valid regressionbased confidence intervals (CIs) for reference limits, one must evaluate the convolution of two distributions. In this study, age-dependent reference limits with corresponding CIs were produced for blood hemoglobin concentrations over the age interval from newborns to 12 months. We describe how the variance associated with the reference limits can be estimated, and present a Table from which appropriate values can be chosen for the calculation of regression-based reference limits and exact CIs. Also, an equation for the calculation of approximate CIs is given. The data were modeled by linear regression in several cumulative age groups to find the transition zone where the slope changed. After defining this cutoff point, piecewise linear regression was applied. Reference limits and their CIs calculated by conventional and piecewise linear regression methods were almost the same in older age groups but differed significantly during the period of most rapid age-dependent changes, i.e., during the 2 months after birth. The concept of reference values and their determination is based on recommendations of the IFCC and the NCCLS. These recommendations are restricted to situations in which the limits are estimated as specified fractiles by

1 Central Laboratory, University Central Hospital of Turku, Kiinamyllynk 4 – 8, FIN-20520 Turku, Finland. 2 Departments of Clinical Chemistry and 3 Statistics, University of Turku, FIN-20500 Turku, Finland. *Author for correspondence. Fax 358-2-2613920; e-mail arja.virtanen@ utu.fi. Received June 16, 1997; revision accepted October 8, 1997.

327

328

Virtanen et al.: Reference limits

mined. If there was no age dependency in variance of residuals, conventional regression analysis was applied. In the study of Anderson et al. [5] a weighted regression analysis was applied for the determination of age-specific reference intervals for serum prostate-specific antigen (PSA) because the variability of serum PSA values increased with age.4 Burritt et al. [6] defined age- and sex-specific reference intervals for 19 biologic variables in serum samples from children ages 1 to 22 years. They used polynomial regression for age-dependent analytes and the 2.5% and 97.5% percentiles constituted the reference interval. In the Table included in their publication they show reference intervals for several age subgroups. Estimated limits at the center of each age interval were considered as age-group-specific reference limits. They also used a slight modification in older children to have a smooth transition to adult concentrations. Vicente et al. [7] established reference intervals for serum ferritin depending on age and sex. Data were analyzed within each gender separately and dependency on age was first examined by plotting ferritin values vs age. After visual inspection of data, subgrouping according to age was carried out, regression analysis performed as necessary, and 95% prediction intervals were used as reference intervals. Gallo et al. [8] presented a statistical method in which variables that affected reference intervals were considered. In this method variables such as sex, age, weight, and alcohol consumption with possible effect on serum urea were included in a regression model. The effects of age and sex and their interaction on cardiac enzymes was illustrated by regression analysis in the study of Kairisto et al. [9]. The 95% age-specific prediction intervals were calculated and prediction limits were considered as reference limits. However, none of the above articles discussed the reliability of the derived age-specific reference limits. Piecewise regression can be used after plotting the data, for example, on age to model it by linear regression. Gonchoroff et al. [10] used a piecewise linear regression model to estimate the reference interval for alkaline phosphatase in different age groups. Piecewise regression can also be applied in different cumulative age groups. From the different cumulative groups the best-fitting curves can be selected and the point(s) thereby found where the values begin to change to another direction. Especially during the first months of life, many laboratory analytes—including hemoglobin (Hgb) and red blood cell indices— change rapidly. Fig. 1 shows this clearly for Hgb. It is obviously difficult to find any one regression model for the whole age interval. Nonlinear regression (i.e., nonlinear in the parameters) would be required. A simpler method would probably be more useful for examining the various age dependencies of

4 Nonstandard abbreviations: PSA, prostate-specific antigen; Hgb, hemoglobin; and CI, confidence interval.

Fig. 1. Reference limits (2.5% and 97.5%) and 95% CIs of Hgb concentration (g/L) over the age interval 0 –12 months.

different analytes. The visual inspection of plots to judge the fit of data to the derived limits is an important step. In this paper we use piecewise regression analysis to calculate 95% age-specific reference intervals for Hgb in children ages 0 –12 months. Hgb concentrations are known to decrease quickly during the first months of life, after which the concentrations start to increase again [11]. We decided to use the first year of life to describe the use of linear piecewise regression in calculating age-specific reference limits. The conventional reference limits are point estimates from a reference sample group of the true limits, which could be calculated if data were available from the whole reference population. Hence the imprecision of the sample estimates must be considered by determining the confidence intervals (CIs). In this paper we determine the variance associated with reference limits and develop a Table from which the necessary values can be selected for the calculation of CIs of reference limits produced by the regression method. An equation from which approximate intervals can be calculated is also presented.

Subjects and Methods subjects Data were collected during a 2-year period from a total of 2347 hospitalized children ages 0 –1 year. The reference subjects were retrospectively selected according to their diagnoses in the hospital discharge diagnoses database. The list of exclusion diagnoses was based on our previous study [12]. After exclusion steps the remaining reference sample group consisted of 310 children ages 0 –1 year (mean 5.34 months). The range of Hgb values in these reference subjects was 86 –250 (mean value 128 g/L). The study protocol was officially accepted at the University Hospital of Turku and was in accordance with the Helsinki Declaration of 1975, as revised in 1983.

329

Clinical Chemistry 44, No. 2, 1998

analytical methods Blood specimens were obtained either by skin puncture or venipuncture as part of routine examination of the children. The blood was collected into microtubes containing EDTA as an anticoagulant. Coulter Counter S-Series (S Plus VI and T-880, Coulter Electronic) or Technicon H6000 (Technicon Instruments Corp.) analyzers were used for the analysis. The Coulter Counter S Plus VI was used as a master instrument and other analyzers were calibrated against it to produce the equivalent results. For internal quality control, stable control specimens and retained patient specimens were used.

statistical methods calculation of reference limits and cis In regression analysis, independent variables (e.g., age) are used to predict a dependent variable (analyte in question). A regression equation with one independent variable is [13] Yi 5 b0 1 b1x i 1 ei , i 5 1,2, . . . , n

(1)

In a regression model, each observation (Y) is assumed to come from a normal distribution centered at the concentration (b01b1xi) implied by the model with the same variance (s2). The regression-based upper reference limit for a specified age point x0 is a quantile: Q 0 5 b 0 1 b 1 x 0 1 Z as

(2)

where Za is a fractile from standard normal distribution, usually 61.96. Even though the mean square error (S2) is an unbiased estimator of s2 for the regression model, the square root S of S2 is not an unbiased estimator of s. Because (n 2 p)S2/s2 is distributed as x2 with n 2 p degrees of freedom—where n is sample size and p is number of parameters—the following is obtained: E(S) 5 s/an 2 p,

(3)

where—in this situation—an2p is approximately =(n 2 p)/(n 2 p 2 0.5) [14]. The exact value can be calculated by using gamma function, but here the approximation is used. Thus the estimator for the Eq. 2 is: ˆ 0 5 bˆ 0 1 bˆ 1x 0 1 Zas˜ Q

(4)

5 Yˆ0 1 Zas˜ where Yˆ0 is the fitted value at age point x0 and s˜ 5 an2pS is an unbiased estimator of s. The variance of the estimator (Eq. 4) is: ˆ 0) 5 Var(bˆ 0 1 bˆ 1x0) 1 Var(Zas˜) Var(Q

(5)

The variance of the first part of the Eq. 5 is: # 2/S (x 2 X) # 2] Var(bˆ 0 1 bˆ 1x0) 5 s2[1/n 1 (x0 2 X) i i (6) 5 s2(v)

where x0 is the age point at which reference limits are # is the mean value of age and v 5 [1/n 1 estimated and X 2 # # 2]. The variance of s˜ in the latter part of (x02 X) /Si(xi2 X) Eq. 5 is: Var(s˜) 5 an 2 p2Var(S)

(7)

5 an 2 p2{E(S2) 2 [E(S)]2} 5 s2(an 2 p2 2 1) In a simple linear regression situation with only one independent variable (p 5 2, i.e., two regression parameters b0 and b1), the variance associated with the reference limit is ˆ 0) 5 s2[v 1 Z2a(an 2 p2 2 1)] Var(Q

(8)

where Z2a is square of the fractile of the normal distribution. The square root of above equation is the standard error, which is needed when CIs are calculated. When appropriate values (q) from Table 1 are chosen, exact CIs for regression-based reference limits can be defined: CI 5 (bˆ 0 1 bˆ 1x0 1 Zas˜) 6 qS

(9)

5 (bˆ 0 1 bˆ 1x0 1 Zaan 2 pS) 6 qS This result corresponds to the conventional reference distribution with bˆ 0 1 bˆ 1x0 as a substitute for the mean. The regression-based limits, likewise, correspond to the conventional reference limits, and intervals around upper and lower limits correspond to the conventional CIs for reference limits. The Appendix contains the formula from which appropriate values of q can be calculated. An approximate CI that is based on the asymptotic ˆ 0 is: normal distribution of Q CI 5 (bˆ 0 1 bˆ 1x0 1 Zaan 2 pS) 6 Za9an 2 p

(10)

[v 1 Z2a(an 2 p2 2 1)]1/2S where Za9 is a fractile from a normal distribution corresponding to the appropriate confidence level. In the Appendix an example and code of a SAS program to calculate reference limits and CIs in piecewise linear regression are given. Because our main goal was to evaluate reference limits and their CIs, the percentile intervals calculated here are suitable. For other purposes tolerance intervals and prediction intervals may be used [15].

determination of the point of change Because of the known change in age dependency of Hgb during the first months of life, six cumulated age groups were formed (0 –1, 0 –2, 0 –3, 0 – 4, 0 –5, and 0 – 6 months). The age interval, i.e., the used step, was determined by calculating the optimal window width according to formula [16]: W 5 0.9An( 2 1/5), where

330

Virtanen et al.: Reference limits

Table 1. Values needed when exact CIs are calculated. v DF

0.01

0.02

0.04

0.06

0.08

0.1

0.15

0.2

0.25

0.3

0.4

0.5

5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 25 30 35 40 45 50 100

2.050 1.728 1.509 1.350 1.229 1.133 1.055 0.991 0.937 0.890 0.850 0.815 0.785 0.757 0.733 0.711 0.627 0.569 0.527 0.494 0.468 0.447 0.340

2.065 1.744 1.526 1.368 1.248 1.153 1.076 1.013 0.959 0.914 0.875 0.841 0.811 0.784 0.761 0.739 0.658 0.604 0.564 0.534 0.509 0.489 0.394

2.095 1.775 1.559 1.403 1.285 1.191 1.116 1.054 1.003 0.959 0.922 0.889 0.861 0.836 0.814 0.794 0.718 0.668 0.632 0.605 0.583 0.565 0.483

2.124 1.806 1.592 1.437 1.320 1.228 1.155 1.095 1.045 1.003 0.967 0.936 0.909 0.885 0.864 0.845 0.774 0.727 0.694 0.668 0.648 0.632 0.559

2.152 1.836 1.624 1.470 1.355 1.264 1.192 1.134 1.085 1.045 1.010 0.980 0.954 0.932 0.912 0.894 0.826 0.782 0.750 0.727 0.708 0.693 0.625

2.180 1.866 1.654 1.502 1.388 1.299 1.229 1.172 1.125 1.085 1.052 1.023 0.998 0.977 0.957 0.940 0.875 0.833 0.803 0.781 0.763 0.749 0.685

2.247 1.937 1.728 1.580 1.469 1.384 1.317 1.263 1.219 1.182 1.151 1.125 1.102 1.082 1.064 1.048 0.988 0.949 0.922 0.902 0.886 0.873 0.816

2.312 2.004 1.799 1.653 1.546 1.464 1.401 1.350 1.308 1.273 1.244 1.219 1.197 1.178 1.161 1.146 1.090 1.053 1.028 1.008 0.994 0.982 0.929

2.375 2.069 1.867 1.724 1.621 1.542 1.481 1.432 1.392 1.359 1.331 1.306 1.285 1.267 1.251 1.236 1.183 1.148 1.123 1.105 1.091 1.080 1.030

2.435 2.132 1.933 1.794 1.693 1.617 1.558 1.510 1.472 1.439 1.412 1.389 1.368 1.351 1.335 1.321 1.269 1.235 1.211 1.194 1.180 1.169 1.121

2.551 2.254 2.060 1.927 1.831 1.759 1.702 1.657 1.620 1.589 1.563 1.541 1.521 1.504 1.489 1.476 1.426 1.394 1.371 1.354 1.341 1.331 1.285

2.663 2.371 2.183 2.054 1.961 1.891 1.837 1.793 1.757 1.727 1.702 1.680 1.661 1.644 1.629 1.616 1.568 1.536 1.514 1.497 1.485 1.474 1.430

DF

0.01

0.02

0.04

0.06

0.08

0.1

0.15

0.2

0.25

0.3

0.4

0.5

5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 25 30 35 40 45 50 100

1.405 1.203 1.064 0.965 0.889 0.831 0.783 0.743 0.709 0.680 0.654 0.632 0.612 0.594 0.577 0.562 0.503 0.461 0.430 0.405 0.385 0.368 0.283

1.415 1.214 1.077 0.979 0.906 0.848 0.801 0.762 0.729 0.701 0.676 0.654 0.635 0.617 0.601 0.587 0.531 0.491 0.461 0.438 0.420 0.404 0.328

1.436 1.238 1.104 1.008 0.937 0.882 0.837 0.800 0.769 0.742 0.718 0.697 0.679 0.663 0.648 0.634 0.582 0.546 0.519 0.498 0.482 0.468 0.404

1.457 1.261 1.130 1.037 0.970 0.915 0.872 0.836 0.806 0.780 0.758 0.738 0.721 0.705 0.691 0.679 0.629 0.596 0.571 0.552 0.537 0.525 0.467

1.477 1.284 1.156 1.066 1.000 0.948 0.906 0.872 0.843 0.818 0.796 0.778 0.761 0.746 0.732 0.720 0.674 0.642 0.619 0.601 0.587 0.576 0.523

1.497 1.307 1.182 1.095 1.030 0.980 0.939 0.906 0.878 0.854 0.833 0.815 0.799 0.784 0.772 0.760 0.715 0.685 0.663 0.646 0.633 0.622 0.573

1.547 1.365 1.247 1.165 1.104 1.056 1.018 0.987 0.961 0.939 0.919 0.902 0.887 0.874 0.862 0.851 0.810 0.783 0.763 0.748 0.736 0.726 0.683

1.598 1.423 1.310 1.232 1.174 1.129 1.093 1.063 1.038 1.017 0.998 0.982 0.968 0.956 0.944 0.934 0.896 0.870 0.851 0.837 0.826 0.817 0.777

1.649 1.480 1.373 1.297 1.241 1.198 1.163 1.134 1.110 1.090 1.072 1.056 1.043 1.031 1.020 1.010 0.973 0.949 0.931 0.918 0.907 0.899 0.861

1.700 1.537 1.433 1.360 1.305 1.263 1.229 1.201 1.178 1.158 1.141 1.126 1.113 1.101 1.091 1.081 1.046 1.022 1.005 0.992 0.982 0.974 0.938

1.802 1.648 1.548 1.478 1.426 1.385 1.353 1.326 1.303 1.284 1.268 1.253 1.241 1.230 1.220 1.211 1.177 1.154 1.138 1.126 1.116 1.109 1.075

1.902 1.754 1.657 1.589 1.538 1.498 1.466 1.440 1.418 1.399 1.383 1.369 1.357 1.346 1.336 1.328 1.295 1.273 1.257 1.245 1.236 1.229 1.196

DF, degrees of freedom; v 5 [1/n 1 (x0 2 X# )2/(i (xi 2 X# )2]. Values for 95% CIs in upper panel and for 90% CIs in lower panel.

W 5 optimal window width, A 5 min(s,R/1.34), s is the estimate of standard deviation, and n is the number of children. R is the interquartile range, i.e., the difference between third and first quartiles [16]. The point of change can be estimated, for example, by

modeling the data in different cumulative age groups and by finding the best-fitting model, i.e., with greatest R2 (coefficient of determination) and minimal residual mean square. If the rest of the data fit well for linear regression, then the intersection of those regression lines is the point

331

Clinical Chemistry 44, No. 2, 1998

of change. If necessary, the extension to more than two piecewise regression lines is straightforward [13]. If the point where the slope of the regression model changes is known, one can determine an indicator variable that takes this change point into account and a piecewise regression model can be applied. The regression equation is: Y 5 b0 1 b1xi1 1 b2(xi1 2 xc)xi2 1 ei,i 5 1,2, . . . ,n

(11)

where xc is the change point and xi2 is the indicator variable that has a value of 0 or 1 depending on the value of xi1.

aptness of regression model It is important that the statistical procedure includes a check for the regression. The regression models are based on the assumptions that the distribution of residuals is normal and their variance is constant [13]. Hence it may be necessary to make a logarithmic (or other) transformation of the original measurements to fullfill these requirements. In this study the normality of residuals was checked by the Shapiro–Wilk statistic, and graphic analysis of residuals was done to provide information about constancy of residuals. Examination of residuals is important because residual plots allow deviations from linearity to be clearly seen. Formal testing for linearity can also be done, but graphic analysis of residuals is usually sufficient. Diagnostic measures should be used for the detection of problematic observations, i.e., outliers and influential data points. Outliers and influential observations have no general definition, and their meaning varies from one author to another. In this study an observation was considered influential if it had a major influence on the fitted model, i.e., on the values of regression parameters or predicted values. We used Cook’s influence statistic, high leverage points, dffits, and dfbetas statistics to detect influential observations [17–19]. An observation was considered to be an outlier if its dependent variable was either much higher or lower than the dependent variable of other observations with similar independent variables. The studentized residuals were determined for that purpose [20].

All the statistical calculations were done by SAS® for Windows 6.11 package (SAS Institute) and graphic presentation by Microcal OriginTM for Windows (Microcal Software). Values for Table 1 were calculated by MathematicaTM (Wolfram Research).

Results Figure 1 shows that the minimum Hgb value is ;1–2 months, and after that the concentrations start to increase slowly. The greatest changes occur during first 6 months. Because the variability of Hgb values decreased with age, a logarithmic transformation on the Hgb values was applied. In each of the cumulative age groups a linear regression was fitted (see Table 2). The best-fitting models—with greatest R2 and minimal residual mean square—were examined by residual analysis. Residuals (observed value-fitted value) were plotted against fitted values and age. The Shapiro–Wilk statistic was used to evaluate the normality of the distribution of residuals. Outliers and influential data points were identified. A linear regression with the age group from newborns to 2 months was found to fulfill all the above criteria best. A second regression model was fitted to the rest of the data and the aptness of this model was also investigated by the methods described above. The intersection of these two lines occurred at 1.6 months, and this was selected to be the point where the slope of the piecewise linear model changed. Finally, the whole age interval was modeled by a linear piecewise regression method with a change point of 1.6 months. Residuals, outliers, and influential data points also were estimated in this final model. Five outlying observations with studentized residuals greater than the critical value of the Table (a 5 0.10) derived by Lund [20] were detected. However, these values were not removed, as we did not have any specific clinical information that would justify their removal. In Fig. 1 the 2.5% and 97.5% reference limits and their CIs for Hgb values were fitted on the whole age interval by the described methods. In Fig. 2 the age interval is extended only through the first 3 months to better display the higher variability in newborns. Table 3 shows the numeric reference limits with corresponding CIs for different ages and age groups. The

Table 2. Coefficient of determination, root mean square, P-value of Shapiro–Wilk statistic, and sample size in different age groups in logarithmic regression models. Age groups, months

0–1 0–2 0–3 0–4 0–5 0–6 2–12 0–12a a

Piecewise regression.

R2, %

Root mean square

P

n

29.6 65.1 64.2 60.2 43.2 42.4 4.8 70.8

0.132 0.137 0.148 0.157 0.176 0.172 0.081 0.100

0.42 0.49 0.30 0.07 0.26 0.14 0.97 0.31

63 88 104 121 152 170 222 310

332

Virtanen et al.: Reference limits

Reference limits as reviewed by Tietz [21] and limits calculated by our method are also shown in that Table.

Discussion

Fig. 2. Reference limits (2.5% and 97.5%) and 95% CIs of Hgb concentration (g/L) over the age interval 0 –3 months.

exact limits and intervals for a specified age point could be supplied by computer on the laboratory report. Conventional reference limits with CIs are shown in Table 4. These were calculated by the Refval program (from H. E. Solberg, Department of Clinical Chemistry, Rikshospitalet, Oslo, Norway) after division of data into two age groups (0 –1.6 months and 1.6 months–1 year). Table 3. Reference limits (2.5% and 97.5%) and their 95% CIs in different age groups for Hgb concentration (g/L). Age group

0–1 Days 2 Days 3 Days 1–,2 Weeks 2–,3 Weeks 3–,4 Weeks 4–,5 Weeks 5–,6 Weeks 6–,8 Weeks 2–,3 Months 3–,7 Months 7–12 Months a

Lower limita

Upper limita

155 (149–160) 153 (148–158) 151 (146–156) 132 (129–136) 121 (118–124) 111 (109–114) 102 (100–105) 94 (91–97) 92 (89–95) 92 (89–95) 93 (90–95) 95 (94–97)

232 (224–240) 226 (219–234) 224 (217–232) 213 (206–219) 195 (190–201) 180 (175–184) 165 (161–169) 151 (148–155) 139 (135–143) 137 (134–141) 141 (139–144) 147 (143–151)

Our data with regression analysis.

Although regression is frequently used for the estimation of age-specific reference limits, we were not able to find any previous publications where exact CIs for reference limits had been evaluated. Harris and Boyd [22] introduced nonparametric and parametric methods that have been used in estimating and smoothing percentile curves as a function of age. Elveback and Taylor [23] presented some statistical methodology of estimating percentiles and CIs, and they give an equation that corresponds to our approximative CI, i.e., Eq. 10. Royston [24] presented intervals that were approximate and that could be used only if the CI for the mean value of x in the sample were needed. Wright and Royston [25] presented a brief discussion of age-related reference intervals and only mentioned confidence bands that could be calculated from 6 2 (SE). We believe that evaluation of the reliability of regression-based reference limits is just as important as the evaluation of the reliability of conventional reference limits. Determination of the minimum sample size when using regression is not straightforward because there are no power functions available for choosing sample size when reference limits are estimated by this approach. In this case the sample size, as well as the estimated variance, affect the width of the reference interval and CIs. A prerequisite for reference interval estimation is that the reference interval should remain stable, i.e., the width of reference interval should not be dependent on sample size. The method of linear models does not need a large sample size. Some suggestions about its application for reference interval estimation are presented by Royston [24]. CIs become narrower as sample size increases. How precise CIs need to be (and, thus, the optimal sample size) must be estimated by the user. In reference interval estimation, the IFCC proposed that at least 120 observations are needed for reliable estimates [26]. If nonparametric CIs are produced, this is indeed the minimum sample size.

Table 4. Reference limits (2.5% and 97.5%) and their 95% CIs at different age points for Hgb concentrations (g/L). Age group, months

0.5 1.0 2.0 4.0 6.0 7.0 9.0 12.0 a b c

Lower limita

132 111 92 93 95 95 97 99

(128–135) (109–114) (89–95) (91–95) (93–97) (94–97) (95–99) (96–102)

Our data with regression analysis. Our data with Refval program (see text). Tietz [21] (data of confidence limits is missing).

Upper limita

195 (190–201) 165 (161–169) 136 (132–140) 138 (135–141) 140 (138–143) 141 (139–144) 144 (140–147) 147 (143–151)

Lower limitb

Upper limitb

98 (87–108)

224 (213–234)

97 (95–99)

138 (136–141)

Lower limitc

Upper limitc

134 107 94 103 111

198 171 130 141 141

114 113

140 141

333

Clinical Chemistry 44, No. 2, 1998

In Fig. 3, approximate and exact values have been fitted at different degrees of freedom with the function of v. Here, as in Table 1, v has been chosen so that unreliable calculations are prevented, i.e., the maximum value of v is 0.5. One can see that approximate values are near the exact values when the sample size is as small as 20. The difference between the values ranges from 0.06 to 0.08. Thus, if CIs for regression-based reference limits are estimated, approximate values can be used even for relatively small sample sizes. The width of the CIs naturally depends on sample size. In our Hgb data the sample size is 310 and the estimated 97.5% reference limit with corresponding 95% CI at 2 months of age is 136 (132–140). If we assume that we have 40 degrees of freedom (i.e., sample size is 43), the limit and intervals would be 136 (129 –144). At this sample size (43) the exact and approximate CIs are precisely the same. In our Hgb data when the sample size is decreased from 310 to 43, the width of the CI would increase about 7 g/L. Thus it seems that a relatively small sample size is sufficient for the method. This is of great importance, e.g., when considering pediatric reference samples because of the difficulties in the collection of large reference sample groups. A distinct advantage in the use of piecewise regression analysis is that partitioning of the data into several subgroups becomes unnecessary and the sudden changes in reference limits at certain age limits can be avoided. The estimate of the mean of Y is less precise when age is located farther away from its mean value. Thus CIs around the reference limits are wider the farther age is from its mean [13]. However, this effect is only marginal when the estimation of limits and intervals is restricted to the same age interval from which the data originated. Because logarithmic transformation was applied, the upper and lower CIs are not equal. This occurs also in conventional reference interval determination upon retransformation of CIs to original scale. Table 4 shows limits and intervals calculated by our

method and by the Refval program. The derived limits by both methods are close to each other, especially when comparing children approaching the age of 1 year. The greatest differences in reference limits by regression and conventional methods can be seen at ages at which Hgb values show steep changes, i.e., in age groups 0 – 48 days. Further subgrouping with the conventional method is not possible because too few reference subjects would remain in each age group. Without considering the age effect, the lower reference limit with CI would be 98 (87–108) and upper limit 224 (213–234). There are several ways to use regression analysis for the calculation of age-specific reference limits. Nonlinear regression models may be flexible but impractical to use especially over age periods such as newborns, puberty, or menopause, where some underlying physiological change may affect the age dependency. Separate regression models could be applied to specified age intervals, but this would introduce the problem of sudden changes in limits between different age groups [5, 27]. The method used in this study prevents such difficulties. We still strongly recommend visual inspection of data in addition to statistical calculations and measures to check the appropriateness of the model. The proposed method is practical and could be used for a variety of laboratory analytes that show dependency on age or other known characteristics. We recommend it be used in clinical laboratories to improve the quality of age-dependent reference limits. The necessary calculations and display of data can be done with microcomputers and basic statistical software.

Appendix exact cis for reference limits The exact CIs (Eq. 9) satisfy the following equation: ˆ 0 1 qS) 5 1 2 a ˆ 0 2 qS # Q0 # Q Pr(Q ˆ 0 and making Inserting the expressions for Q0 and Q rearragements gives Pr{[Za 2 (Zaan 2 p 1 t)(S/s)]/Ïv # (Yˆ0 2 b0 2 b1x0)/sÏv # [Za 2 (Zaan 2 p 2 t)(S/s) ]/Ïv} 5 1 2 a from which the following formula can be derived: F(q) 2 F( 2 q) 5 1 2 a where F(t) 5

E

`

F@~t 2 an2p Za! ~ Î~u/n 2 p!#/Ïv

0

1 Za/Ïv)fx2(n 2 p) ~u! du Fig. 3. Exact (solid lines) and approximate (dotted lines) values needed when 95% CIs are calculated.

5 F(t ; Za, n 2 p, Ïv) Some values for this distribution are given in Table 1.

334

Virtanen et al.: Reference limits

example In our Hgb data after a logarithmic transformation was applied, S250.0100. If 97.5% reference limit and 95% CIs are evaluated at 2 months of age, the following calculations are needed: an 2 p 5 În 2 p)/n 2 p 2 0.5) 5 1.0008 Yˆ0 5 bˆ 0 1 bˆ 1x0 2 bˆ 2x, where bˆ 2 5 0 since the indicator variable has a value of zero. 5 (4.713 1 0.007x0) 5 4.717, which is the mean value of Hgb at given age point ˆ 5 bˆ 0 1 b1x0 1 Zaan2pS Q 5 4.913, which is the upper 95% reference limit at age 2 months. CIu 5 (bˆ 0 1 bˆ 1x0 1 Zaan2pS) 1 Za9an2p=[v 1 Z2a(an2p2 2 1)]S 5 4.940, which is the upper 95% CI for the upper reference limit. Value of v is 0.0135. CI1 5 (bˆ 0 1 bˆ 1x0 1 Zaan2pS) 2 Za9an2p=[v 1 Z2a(an2p2 2 1)]S 5 4.885, which is the lower 95% CI for the upper reference limit. Because our sample size is .300, an approximate value can be used, and it is 1.96(1.0008)(0.1406) 5 0.2758. Hence, after retransformation the 97.5% reference limit at 2 months of age with 95% CI is 136 (132–140).

sas program This program calculates 2.5% and 97.5% reference limits and 95% CIs in piecewise linear regression. options linesize575 pagesize556 nodate; data limits; n-310; /*sample size*/ nn5(1/n); df5n23; /*degrees of freedom*/ x1m55.3384357048; /*mean value of age in the sample*/ x2m520.260042289; /*mean value of indicator variable*/ x1150.0004267694; /*value of x11 in inverse of X9X*/ x12520.002225561; /*value of x12 in inverse of X9X*/ x21520.002225561; /*value of x21 in inverse of X9X*/ x2250.024588304; /*value of x22 in inverse of X9X*/ s50.10013; /*root mean square error*/ z51.96; /*fractile needed to calculate 95%*/ z25z**2; /*reference and confidence intervals*/ b054.703196; /*intercept term*/ b150.007488; /*point estimate ofb1 (age)*/ b2520.347497; /*point estimate of b2 (indicator variable)*/ do age50.00 to 12.000 by 0.001; /*age interval*/ if age gt 1.571 then x150; /*point of change in*/ else x151; /*our data is 1.571*/ iv5(age21.571)*x1; /*indicator variable*/ diff15age2x1m; diff25iv2x2m;

m15(diff1*x11)1(diff2*x21); m25(diff1*x12)1(diff2*x22); part15m1*diff1; part25m2*diff2; all5part11part2; v5nn1all an5sqrt(df/(df20.5)); an25an*an; vza5v1(z2*(an221)); svza5sqrt(vza); zas5z*an*s; pred5exp(b01(b1*age)1(b2*iv)); Q25(b01(b1*age)1(b2*iv)1(zas)); /*upper ref. limit ln*/ Q2a5exp(Q2); /*upper ref. limit in original unit*/ Q15(b01(b1*age)1(b2*iv)2(zas)); /*lower ref. limit ln*/ Q1a5exp(Q1); /*lower ref. limit in original unit*/ former5zas*svza; cilrl5exp(Q12former); /*lower confidence interval and*/ cilru5exp(Q11former); /*upper for lower reference limit*/ ciurl5expQ22former); /*lower confidence interval and*/ ciuru5exp(Q21former); /*upper for upper reference limit*/ output; end; run;

References 1. Solberg HE, Gra¨sbeck R. Reference values. Adv Clin Chem 1989; 27:2–79. 2. NCCLS. Document C28-A. How to define reference intervals in the clinical laboratory; approved guideline. Wayne, PA: NCCLS, 1995: 1–59. 3. Harris EK, Boyd JC. On dividing reference data into subgroups to produce separate reference ranges. Clin Chem 1990;36:265–70. 4. Irjala K, Koskinen P, Icen A, Palosuo T. Refrerence intervals for immunoglobulins IgA, IgG and IgM in serum in adults and in children aged 6 months to 14 years. Scand J Clin Lab Invest 1990;50:573–7. 5. Anderson JR, Strickland D, Corbin D, Byrnes J, Zweiback E. Age-specific reference ranges for serum prostate-specific antigen. Urology 1995;46:54 –7. 6. Burritt MF, Slockbower JM, Forsman RW, Offord KP, Bergstralh EJ, Smithson WA. Pediatric reference intervals for 19 biologic variables in healthy children. Mayo Clin Proc 1990;65:329 –36. 7. Vicente C, Porto G, De Sousa M. Method for establishing serum ferritin reference values depending on sex and age. J Lab Clin Med 1990;779 – 84. 8. Gallo C, Turco MC, Sanzi G, La Rocca S, Sacchetti L, Salvatore F. Application of a statistical approach to determining reference intervals in clinical chemistry. Clin Chem 1985;31:1367–70. 9. Kairisto V, Ha¨nninen KP, Leino A, Pulkki K, Peltola O, Na¨nto¨ V, et al. Generation of reference values for cardiac enzymes from hospital admission laboratory data. Eur J Clin Chem Clin Biochem 1994;32:789 –96. 10. Gonchoroff DG, Branum EL, Cedel SL, Riggs BL, O’Brien JF. Clinical evaluation of high-performance affinity chromatography for the separation of bone and liver alkaline phosphatase isoenzymes. Clin Chim Acta 1991;199:43–50. 11. Dallman PR, Siimes MA. Percentile curves for hemoglobin and red cell volume in infancy and childhood. J Pediatr 1979;94:26 –31. 12. Na¨nto¨ V, Salmi TT, Kairisto V, Kouri T, Rajama¨ki A, Uusipaikka E,

Clinical Chemistry 44, No. 2, 1998

13. 14. 15.

16. 17. 18. 19.

Virtanen A. Pediatric reference values for hematologic parameters from diagnosis-selected hospital patient population. IX European Congress Of Clinical Chemistry, Krakow, Poland, September 8 –14, 1991. Neter J, Wasserman W. Applied linear statistical models. Homewood, IL: Richard D. Irwin, 1974:30, 73, 97–111, 313–5. Kendall M, Stuart A. The advanced theory of statistics, Vol. 1. Distribution theory. London: Charles Griffin & Co., 1977:399. Hahn GJ. Statistical intervals for a normal population, part I. Tables, examples and applications. J Qual Technol 1970;2: 115–25. Silverman BW. Density estimation for statistics and data analysis. New York: Chapmann and Hall, 1986:47– 8. Freund RJ, Littell RC, Spector PC. SAS system for linear models. Cary, NC: SAS Institute, 1986:43. Cook RD, Weisberg S. Residuals and influence in regression. New York: Chapman and Hall, 1982:10 – 8, 116 –26. Cook RD. Influential observations in linear regression. J Am Stat Assoc 1979;74:169 –74.

335

20. Lund RE. Tables for an approximate test for outliers in linear models. Technometrics 1975;17:473– 6. 21. Tietz NW. Clinical guide to laboratory tests. Philadelphia: W.B. Saunders, 1990:284. 22. Harris EK, Boyd J. Statistical bases of reference values in laboratory medicine. New York: Marcel Dekker, 1995:151–73. 23. Elveback LR, Taylor WF. Statistical methods of estimating percentiles. Ann N Y Acad Sci 1969;161:538 – 48. 24. Royston P. Constructing time-specific reference ranges. Stat Med 1991;10:675–90. 25. Wright EM, Royston P. A comparison of statistical methods for age-related reference intervals. J R Stat Soc 1997;160:47– 69. 26. IFCC. Approved recommendation (1987) on theory of reference values. Part 5. Statistical treatment of collected reference values. Determination of reference limits. J Clin Chem Clin Biochem 1987;25:645–56. 27. Wyatt DT, Nelson D, Hillman RE. Age-dependent changes in thiamin concentrations in whole blood and cerebrospinal fluid in infants and children. Am J Clin Nutr 1991;53:530 – 6.

Suggest Documents