Editorials
Clinical Chemistry 58:5 808–810 (2012)
Establishing Pediatric Reference Intervals: A Challenging Task Ferruccio Ceriotti*
Defining pediatric reference intervals (RIs) is one of the most difficult tasks in our profession. The continuously changing physiology of growing children makes their laboratory values a moving target. For specific analyte measurement in pediatric age, one has to consider the interindividual variation not only of the homeostatic point, but also of the timing with which the development of the individual occurs. In this sense, the first year of life and puberty are the most critical periods. In addition, ethnic and behavioral differences may also cause variations. The only way to deal with such complexity is to gather large populations of reference individuals; however, the ethical problems involved in drawing blood from a healthy newborn or child further complicate this already demanding effort. Therefore, although one of the seminal reports on RIs was published in 1960 by pediatrician Albert John Schneider (1 ), it is not surprising that very few data on pediatric RIs exist. It is against this rather dark backdrop that the work of Colantonio and coworkers (2 ) described in this issue of the Journal sheds new light; however, a careful evaluation of this study is necessary to fully understand its relevance. Ceriotti et al. (3 ) have listed 4 criteria that must be considered in the appraisal of any RI study: (a) the selection of reference individuals; (b) the study design; (c) the analytical quality of the data; and (d) the statistical treatment of the data. The selection of reference individuals represents one of the major strengths of the study of Colantonio et al. These investigators applied an “a priori” selection scheme by enrolling volunteers or using leftover samples from well-characterized healthy newborns or from select outpatient clinics (for dentistry, bone fractures, plastic surgery). Such an approach is particularly relevant because, as indicated by the IFCC and by CLSI approved guideline C28-A3 (4, 5 ), it permits the characteristics of the reference individuals to be clearly defined. Owing to the difficulties of enrolling a sufficient
Diagnostica e Ricerca San Raffaele, Istituto Scientifico Universitario San Raffaele, Milano, Italy. * Address correspondence to the author at: Diagnostica e Ricerca San Raffaele, Via Olgettina 60, 20123 Milano, Italy. Fax ⫹39-02-26432640; e-mail ceriotti.
[email protected]. Received February 13, 2012; accepted February 16, 2012. Previously published online at DOI: 10.1373/clinchem.2012.183483
808
number of individuals, many authors in the pediatric field have advocated the use of the so-called datamining or indirect-sampling approaches. These techniques, initially popularized by Hoffmann (6 ), are based on the assumption (subsequently confirmed by observation) that the majority of laboratory results are “normal.” Statistical calculations based on an underlying presumed distribution of the laboratory results are used to extract the healthy individuals from the study population; however, such approaches are fraught with problems. First, there is no certainty that the assumed underlying distribution is correct, especially when one considers that it is skewed in the majority of cases. Second, these methods provide no control of preanalytical variables (fasting status, type of tubes, and timing before centrifugation and/or analysis) and no control of analytical variables (calibration status, quality control, and so forth). Finally, the results of studies that have attempted to validate such methods have been disappointing (7, 8 ). These methods cannot, therefore, be endorsed to establish new RIs. With regard to study design, Colantonio and coworkers (2 ) applied uniform criteria for sample procurement. They went to great lengths to recruit a sufficient number of individuals (⬎2000 children evenly divided between males and females), particularly in the younger age groups ⬍1 year of age, for which there were about 1000 individuals. With respect to the analytical quality of their data, Colantonio et al. reported all information regarding the analytical quality of the methods in their Supplemental Table 1. Although the characteristics and performance of the analyzer used are clearly described, the information on metrologic traceability is only indirect. They relied completely on the manufacturer’s indications and did not provide any direct confirmation of the traceability of the obtained data. It is important to stress the relevance of analytical quality, because the lack of standardization adds an additional layer to the already complex situation of RIs that vary according to age, sex, and ethnic group. Two aspects need to be considered: the metrologic traceability and the endorsement of non–state-of-the-art methods. Demonstrating traceability via comparison with higher-order methods or materials (9, 10 ) gives universal credibility and value to laboratory test results, independent of the analyzer or methods used. Any laboratory using a trace-
Editorials able method can adopt the intervals developed elsewhere for a similar traceable method as long as there are no biological differences in the underlying populations that would lead to different RIs. Colantonio and coworkers plan to carry out transferability experiments to provide pediatric RIs for other analytical systems; however, this activity would not be necessary if they could have demonstrated the accuracy of their results by comparing them with commutable reference materials or with reference methods. In 1994, Norbert Tietz published an opinion article entitled, “Accuracy in Clinical Chemistry—Does Anybody Care?” (11 ). His plea seems to have been ignored if, 18 years later, we are still proposing method-dependent RIs for common clinical chemistry analytes. Standardization and harmonization are not just the fixation of some foolish metrologists. They represent ethical issues, as Bossuyt et al. have pointed out (12 ), and are very necessary to improve patient safety and clinical effectiveness (13 ). A related problem of the Colantonio et al. study, in my opinion, is its seeming endorsement of non–stateof-the-art methods. To make progress in our discipline, we should promote only the best analytical principles. It is well known that changing habits and traditions takes time, and if we allow old-fashioned methods to survive (and developing a well-defined RI is certainly a good way of doing that), it will be difficult to convince people to abandon them. This consideration applies in particular to 3 analytes: creatinine, albumin, and transaminases. For creatinine, a recent report (14 ) showed that enzymatic methods, although not perfect, are clearly superior to Jaffe methods. The superiority of enzymatic methods is particularly true for pediatric values, for which nonspecific interferences may account for ⬎50% of the signal, as is shown for the younger age groups (15 days to 4 years) in Table 1 of the Colantonio et al. report (2 ). For this reason, the IFCC Committee on Reference Intervals and Decision Limits decided to endorse only creatinine RIs obtained with a traceable enzymatic method (3 ). Not surprisingly, the enzymatic creatinine results of Colantonio et al. (2 ) are very similar to those proposed by the IFCC committee. This similarity in the results supports the concept that with proper analytical standardization— and in the absence of ethnicity-related modification— common RIs can be applied. For albumin, the data presented show large differences between RIs obtained via the 2 method principles (bromcresol green and bromcresol purple). Both methods are well known to be poorly specific (15 ), and the claimed traceability of these methods to the certified reference material ERMDA470 is useless, because these methods do not measure albumin specifically, thus causing a break in the traceability chain. Albumin is a relevant parameter for various clinical presentations, ranging from nephrotic
syndrome to nutritional assessment, especially in pediatrics, but it is also widely used to recalculate results for total calcium. Endorsement of such nonspecific measurement methods when more specific immunochemical approaches exist is, in my opinion, inappropriate. A third note concerns transaminases. Here, too, the IFCC is struggling to promote standardization, and nontraceable methods, such as those without pyridoxal phosphate, should be abandoned. Colantonio et al. (2 ) provided additional evidence for this fact in that lower alanine aminotransferase RIs for East Asian and Southeast Asian people occurred only with assays without added pyridoxal phosphate, thus suggesting that the lower values might be related to lower vitamin B6 concentrations in these populations. Such data further support the use of optimized methods only. In their statistical treatment of the data, Colantonio et al. applied the correct statistical approaches and conducted a careful statistical analysis of their data. Some may question their proposal of basing their definition of sex-related RIs only on a statistical criterion. For instance, the investigators used this criterion in proposing a sex-related RI for direct bilirubin (age group, 13–19 years), but the width of the 90% CI of the upper reference limit for girls is approximately 40% of the entire interval. According to CLSI guideline C28-A3 (5 ), an uncertainty this high (a CI ⬎20%) points to the need for a larger reference population. An evaluation of the clinical relevance of the differences between the groups might also have reduced the number of analytes recommended for partitioning when such partitioning was based only on a statistical criterion. For analytes in which the RI changes continuously with age, such as creatinine, inorganic phosphate, and alkaline phosphatase, some of the recommendations regarding partitioning also appear debatable. Colantonio and coworkers are to be commended, however, for making their entire database available online to readers, thus giving everyone an opportunity to analyze the data differently or to use moresophisticated statistical techniques. In particular, I am considering the use of fractional polynomials and exponential transformation (3, 16 ) to derive age-specific RIs for all of the analytes that show continuous agerelated changes. Setting aside all the issues related to assay standardization, the study of Colantonio and coworkers (2 ) is a milestone that represents a quantum leap forward in helping close gaps in pediatric RIs, especially for the first year of life, and even more so for the first few days of life. Given the large size and careful execution of this study, it surely represents a landmark achievement in pediatric clinical chemistry and provides results that will serve the field well into the future. Clinical Chemistry 58:5 (2012) 809
Editorials
Author Contributions: All authors confirmed they have contributed to the intellectual content of this paper and have met the following 3 requirements: (a) significant contributions to the conception and design, acquisition of data, or analysis and interpretation of data; (b) drafting or revising the article for intellectual content; and (c) final approval of the published article. Authors’ Disclosures or Potential Conflicts of Interest: Upon manuscript submission, all authors completed the Disclosures of Potential Conflict of Interest form. Potential conflicts of interest: Employment or Leadership: None declared. Consultant or Advisory Role: F. Ceriotti, Siemens Italia. Stock Ownership: None declared. Honoraria: None declared. Research Funding: None declared. Expert Testimony: None declared.
References
5.
6. 7. 8. 9.
10. 11. 12. 13.
1. Schneider AJ. Some thoughts on normal, or standard, values in clinical medicine. Pediatrics 1960;26:973– 84. 2. Colantonio DA, Kyriakopoulou L, Chan MK, Daly CH, Brinc D, Venner AA, et al. Closing the gaps in pediatric laboratory reference intervals: a CALIPER database of 40 biochemical markers in a healthy and multiethnic population of children. Clin Chem 2012;58:854 – 68. 3. Ceriotti F, Boyd JC, Klein G, Henny J, Queralto´ J, Kairisto V, Panteghini M, on behalf of the IFCC Committee on Reference Intervals and Decision Limits (C-RIDL). Reference intervals for serum creatinine concentrations: assessment of available data for global application. Clin Chem 2008;54:559 – 66. 4. Petitclerc C, Solberg HE, International Federation of Clinical Chemistry (IFCC), Scientific Committee, Clinical Section, Expert Panel on Theory of Reference Values (EPTRV). Approved recommendation (1987) on the theory
810 Clinical Chemistry 58:5 (2012)
14.
15.
16.
of reference values. Part 2. Selection of individuals for the production of reference values. J Clin Chem Clin Biochem 1987;25:639 – 44. CLSI. Defining, establishing, and verifying reference intervals in the clinical laboratory; approved guideline—third edition. CLSI document C28-A3. Wayne (PA): CLSI; 2008. Hoffmann HG. Statistics in the practice of medicine. JAMA 1963;185: 864 –73. Amador E, Hsi BP. Indirect methods for estimating the normal range. Am J Clin Pathol 1969;52:538 – 46. Elveback LR, Guillier CL, Keating FR Jr. Health, normality, and the ghost of Gauss. JAMA 1970;211:69 –75. International Organization for Standardization (ISO). ISO 17511:2003. In vitro diagnostic medical devices – measurement of quantities in biological samples – metrological traceability of values assigned to calibrators and control materials. Geneva: ISO; 2003. Panteghini M. Traceability as a unique tool to improve standardization in laboratory medicine. Clin Biochem 2009;42:236 – 40. Tietz NW. Accuracy in clinical chemistry— does anybody care? Clin Chem 1994;40:859 – 61. Bossuyt X, Louche C, Wiik A. Standardisation in clinical laboratory medicine: an ethical reflection. Ann Rheum Dis 2008;67:1061–3. McLawhon RW. Patient safety and clinical effectiveness as imperatives for achieving harmonization inside and outside the clinical laboratory. Clin Chem 2011;57:936 – 8. Greenberg N, Roberts WL, Bachmann LM, Wright EC, Dalton RN, Zakowski JJ, Miller WG. Specificity characteristics of 7 commercial creatinine measurement procedures by enzymatic and Jaffe method principles. Clin Chem 2012;58:391– 401. Xu Y, Wang L, Wang J, Liang H, Jiang X. Serum globulins contribute to the discrepancies observed between the bromocresol green and bromocresol purple assays of serum albumin concentration. Br J Biomed Sci 2011;68: 120 –5. Royston P, Wright EM. A method for estimating age-specific reference intervals (normal ranges) using fractional polynomials and exponential transformation. J R Stat Soc Ser A Stat Soc 1998;161:79 –101.