Predictors of the Outcome of Surgical Treatment in Acromegaly and ...

6 downloads 0 Views 245KB Size Report
References. 1. Alexander L, Appleton D, Hall R, Ross WM, Wilkinson R. 1980 Epidemi- ology of acromegaly ... Ulster Med J. 59:55– 62. 3. Etxabe J, Gaztambide ...
0021-972X/01/$03.00/0 The Journal of Clinical Endocrinology & Metabolism Copyright © 2001 by The Endocrine Society

Vol. 86, No. 4 Printed in U.S.A.

Predictors of the Outcome of Surgical Treatment in Acromegaly and the Value of the Mean Growth Hormone Day Curve in Assessing Postoperative Disease Activity G. A. KALTSAS*, A. M. ISIDORI*, D. FLORAKIS, P. J. TRAINER, C. CAMACHO-HUBNER, F. AFSHAR, I. SABIN, J. P. JENKINS, S. L. CHEW, J. P. MONSON, G. M. BESSER, AND A. B. GROSSMAN Departments of Endocrinology (G.A.K., A.M.I., D.F., P.J.T., C.C.-H., J.P.J., S.L.C., J.P.M., G.M.B., A.B.G.) and Neurosurgery (F.A., I.S.), St. Bartholomew’s and the Royal London Hospitals, London ECIA 7BE, United Kingdom ABSTRACT Acromegaly is associated with increased morbidity and mortality unless serum GH levels are persistently less than 5 mU/L (⬃2 ng/mL) after treatment. Transsphenoidal surgical resection is the best available treatment for restoring GH to such “safe” levels; however, criteria for the assessment of the response to treatment are not uniform. To determine the clinically most useful method of assessing disease activity postoperatively and identify predictors of a favorable response to surgical treatment, we have analyzed 67 patients with acromegaly who underwent transsphenoidal surgery between 1993 and 1998. We used three different definitions of a satisfactory or safe response: 1) a postoperative mean GH less than 5 mU/L obtained from averaging five serum GH values obtained throughout one day; 2) a random single GH less than 5 mU/L; or 3) a serum insulin-like growth factor I (IGF-I) level within the normal range. Relying on a single GH measurement alone, 9 of the 23 patients with a single postoperative mean GH level less than 5 mU/L obtained at least one GH value of more than 5 mU/L (false positive rate, 28%) and 8 of the patients with a postoperative mean GH value of more than 5 mU/L obtained a single GH value of less than 5 mU/L (false negative rate, 15%). Postoperatively, a significant increase in the fluctuation of random GH values around the mean was observed in patients who were rendered safe (coefficient of variation, from 26 ⫾ 2% to 53 ⫾ 6%; P ⬍ 0.001) compared with patients with persistence of inadequately controlled disease. However, 13% (3 of 23) of patients with mean postoperative GH levels of less than 5 mU/L had elevated serum IGF-I levels postoperatively, and 17% (8 of 44) of patients with mean serum GH levels more than

A

CROMEGALY HAS AN estimated prevalence of 40 – 60 cases per million and an annual incidence rate of 3– 4 cases per million (1–3); untreated acromegaly is associated with increased morbidity and an age-corrected mortality of between 1.9- and 3.3-fold (4 – 6). This excess mortality is eliminated in patients whose treatment succeeds in producing a mean GH level of less than 5 mU/L (2 ng/mL) and a normal serum insulin-like growth factor I (IGF-I); Refs. 5–7). A GHsecreting adenoma is identified in more than 99% of cases and is responsible for both local and systemic effects (8); thus, it has been suggested that the aim of the treatment in acro-

Received September 13, 2000. Revision received December 18, 2000. Accepted December 28, 2000. Address correspondence and requests for reprints to: Prof. A. B. Grossman, Department of Endocrinology, St. Bartholomew’s Hospital, London ECIA 7BE, United Kingdom. E-mail: [email protected]. * These authors contributed equally to this work.

5 mU/L had postoperative IGF-I levels within the normal range. There was no difference in the rate of agreement between mean GH less than 5 mU/L and normalization of IGF-I in relation to the interval since operation when IGF-I levels were measured. Preoperative tumor size and pretreatment mean GH levels were the major determinants of the outcome of surgery, as patients who were rendered safe had significantly lower preoperative mean GH levels than patients who were not cured (median, 31 mU/L vs. 78.5 mU/L, P ⬍ 0.01). IGF-I levels were weakly correlated with tumor size and could not be used to predict the patients who would be rendered safe. Preoperative PRL levels were higher in patients who failed to achieve a surgical satisfactory outcome [498 mU/L (187– 857) vs. 196 mU/L (136 –315), P ⬍ 0.01]. In summary, although single random GH values and IGF-I values are both significantly correlated with mean GH levels, they should not be used as an alternative to averaging several GH values to assess disease activity, because of the pulsatile nature of GH secretion and the multiple factors that may influence serum IGF-I. Because significant discrepancies occur, particularly postoperatively, mean GH levels remain the more reliable indicator of surgical outcome and disease activity. As there is considerably more evidence relating longterm prognosis to serum GH levels than to IGF-I and discrepancies occur between GH levels and IGF-I, we suggest that mean serum GH levels and single IGF-I levels, measured early in the postoperative period, are currently the best biochemical guide to the adequacy of surgery and, hence, the need for further treatment. (J Clin Endocrinol Metab 86: 1645–1652, 2001)

megaly should be reduction of tumor bulk and its neighborhood effects, reversal of systemic effects, a mean GH level of less than 5 mU/L (2 ng/mL), and normal age-related serum IGF-I. However, none of the currently available modalities of treatment can consistently fulfil all these aims when used alone (9). Whereas transsphenoidal surgery (TSS) is currently the optimal therapy in terms of the achievement of immediate remission and mean GH levels less than 5 mU/L in a significant number of patients, the majority will require further treatment to maintain GH levels in or near the desirable range (10, 11). Although radiotherapy has been used as an adjunct to surgical treatment, it results in a relatively slow fall of GH levels over the subsequent years, and interim medical treatment with somatostatin analogs, dopamine agonists, or a GH receptor antagonist is usually needed (12). It is, therefore, important to establish reliable predictors of the outcome

1645

1646

JCE & M • 2001 Vol. 86 • No. 4

KALTSAS ET AL.

of TSS and identify pre- and postoperatively who will most probably require further intensive treatment. A number of factors have been suggested as being useful predictors of the success of surgery: these include tumor size (13), preoperative basal GH levels (14), and impaired pituitary reserve, all of which are associated with poor outcome and the need for further treatment after surgery. However, in some of these studies, single random or basal GH values were used as an indicator of disease activity and the response to surgery (13–18). Although a single random serum GH sample is easier to obtain in practice, normalized mortality has been described only with mean GH levels less than 5 mU/L from multiple sampling throughout the day (5, 6). Over the last decade, estimation of IGF-I as a marker of integrated GH secretion has been suggested by some to be of greater validity than immunoreactive GH levels in the assessment of outcome, in that it should reflect overall GH bioactivity (13, 19 –24). However, we have little data on mortality related to IGF-I (7). Although a number of studies evaluated serum IGF levels in the postoperative follow-up in acromegaly, most of these studies used different criteria to assess residual disease, mainly random GH values or the nadir during glucose suppression, as well as a wide range of different cut-off values (1–5 ␮g/L). Furthermore, in most of these studies the normal reference range for IGF-I values was not age adjusted. Lack of homogeneity in these studies led to contradictory results in the rate of concordance between postoperative IGF-I and GH criteria for remission (25). In general, the most common discrepancy is the finding of ‘normal’ GH with elevated IGF-I levels; less frequent is the opposite finding of elevated GH with normal IGF levels (14, 15, 18, 19, 26 –30). Recent data provided evidence that the extent of GH suppression with oral glucose should be reconsidered, because more than 50% of patients with active disease (elevated IGF-I) have a nadir GH value less than 1 ␮g/L when measured with enhanced sensitivity assays (27). On the contrary, very few data have compared pre- and postoperative changes in mean GH levels with IGF-I using as cut-off a mean GH less than 5 mU/L, which is the only value significantly associated with normalized mortality (7, 19, 24, 26). Finally, many factors other than GH—including nutritional state, liver function, serum protease activity, IGF-I-binding proteins, and sex hormones— contribute to the determination of serum IGF-I, and particularly in the adult IGF-I levels may not be directly related to GH (26, 31, 32). Thus, for example, in the adult GH deficiency syndrome at least 30% of patients have normal serum IGF-I levels (33). We have now evaluated 67 patients with newly diagnosed acromegaly who underwent TSS between 1993 and 1998 to determine: 1) preoperative factors, including IGF-I measurements, that might predict surgical outcome; 2) whether a single random GH measurement is as useful a predictor as a mean GH value in determining disease activity pre- and postoperatively; 3) the interval time for the best postoperative assessment defined as the highest concordance rate between IGF-I levels and mean GH levels; and, thus, 4) the optimal biochemical strategy for defining the response to surgery.

Subjects and Methods From 1993 onward 71 patients with newly diagnosed acromegaly [31 females; median age, 46 yr (range, 12– 80)] underwent TSS; sixty-seven were fully investigated before and after TSS to determine the immediate outcome, the preoperative features associated with a good result, and the accuracy of postoperative testing in predicting the long-term outcome. Thirty patients who underwent TSS between 1993 and 1995 were included in a previous study in which, however, neither serum IGF-I levels nor the correlation between mean GH and random GH values was reported (34). All patients were assessed preoperatively in our center according to a consistent protocol (35). The diagnosis of acromegaly was based on the presence of relevant clinical features and confirmed by demonstration of failure of plasma GH to suppress to levels less than 1 mU/L (1 mU/L ⫽ 0.4 ng/mL) during an 75-g oral glucose tolerance test (OGTT; Refs. 27 and 36 –38). GH secretion was assessed by calculating the mean serum GH from a five-point GH “day curve,” samples being collected at 0830 h, 1100 h, 1300 h, 1700 h, and 1900 h, and basal serum IGF-I measurements. Non-GH-related pituitary function (serum T4, TSH, 0900 h cortisol, PRL, LH, FSH, testosterone, or estradiol) was assessed routinely pre- and postoperatively. The pituitary-adrenal axis was also assessed dynamically by examining the cortisol response to hypoglycemia during an insulin tolerance test, or during a glucagon test. All patients underwent preoperative imaging of the pituitary fossa with either computed tomography or magnetic resonance scanning. Pituitary adenomas were identified in all cases and were classified into microadenomas (intrasellar tumors of ⬍1 cm diameter), mesoadenomas (tumors ⬎1 cm diameter that do not touch the optic chiasm or invade the sphenoidal or cavernous sinuses), or macroadenomas (tumors ⬎1 cm in diameter with suprasellar, inferior, or lateral extension compressing the surrounding structures). TSS was performed by two surgeons (F.A. and I.S.), via a sublabial or transnasal approach, using an image intensifier. Obvious tumor was removed with attempted preservation of apparently normal pituitary tissue. After removal of the tumor, routine histopathological analysis was performed to establish the pattern of reticulin staining plus routine immunostaining for all pituitary hormones. At 7 days postoperatively, serum GH was measured during a GH day curve, as described previously (34). Serum IGF-I levels were measured at least 1 week after surgery; 65% of patients had IGF-I measured 1– 4 weeks after surgery (median, 7 days; 25th–75th percentiles, 6 –10), the remaining 35% later than 4 weeks (median, 90 days, 30 –180). When pituitary deficiency was recorded it was adequately replaced. To assess whether a single random GH measurement provides as accurate an estimate of tissue exposure to GH as the mean GH value obtained from a GH day curve, we correlated each single basal serum GH level obtained from the five-point GH day curve, pre- and postoperatively. We also evaluated pre- and postoperatively the percentage distance of the minimum and the maximum of the set from the mean of the day, and the coefficient of variation in all patients. Furthermore, we analyzed the differences in these indices of fluctuation of GH levels in groups according to surgical outcome. Mean GH levels (based on a five-point day curve) were also obtained from 20 healthy volunteer subjects (age, 23.4 ⫾ 1.8; body mass index, 22.9 ⫾ 1.8; M/F ⫽ 12/8).

Assays Serum GH was measured by immunoradiometric assay with reagents supplied by North East Thames Regional Radiommunoassay Laboratory (London, UK). The functional sensitivity is less than 0.5 mU/L with inter- and intra-assay coefficients of variation of less than 5%. The assay is monitored by the United Kingdom National External Quality Control Scheme. Serum IGF-I was measured using an in-house RIA after formicacid acetone extraction, as described previously (39).

Immunostaining of pituitary hormones Each pituitary tissue sample was stained with hematoxylin and eosin as a general stain to establish the various tissue components. A reticulin stain was performed to provide an indication of tissue architecture. Pituitary hormone stains were performed to establish the hormone phenotype of the tumor samples, as described previously (40, 41).

MEAN GH LEVELS AND OUTCOME OF SURGERY IN ACROMEGALY Statistical analysis All data are presented as median values accompanied by 25th and 75th percentiles, or else as mean ⫾ se. The daily fluctuation of serum GH levels was assessed pre- and postoperatively by calculating the percentage distance of peak and nadir from the mean of a five-point day curve (2-hourly sampling) and the coefficient of variation. Nonparametric tests were used in comparing groups: the Mann-Whitney U test and its generalization by Kruskal and Wallis for comparisons between independent groups; the Wilcoxon matched-pairs signed-rank sum test was used to compare within subjects data before and after TSS (42). The correlation between variables was described using Spearman’s rank correlation coefficients. Logistic regression was used to analyze the relationship between cure after TSS (based on mean serum GH) and clinical variables. To analyze the overall influence of predictors of persistence, a multivariate logistic regression analysis was performed (43). Results were considered statistically significant if the two-tailed P was less than 0.05.

1647

tient had panhypopituitarism; another 22 patients had gonadotropin deficiency whereas 4 patients had ACTH deficiency alone. Twenty-nine patients with pituitary deficiency had macroadenomas, four mesoadenomas, and four microadenomas on pituitary imaging. Outcome of TSS

We have subdivided our patients’ “response” to surgery according to the mean postoperative GH levels. The preoperative levels of mean GH (median, 71.4; 25th–75th percentiles, 26.4 –113.3), IGF-I (829, 599 –1024), and PRL (275, 174 – 710) were significantly lowered by TSS surgery with an 88% reduction in the mean GH (median, 8.8; 25th–75th percentiles, 4.6 –18; P ⬍ 0.01), a 52% reduction in IGF-I (400, 265– 643; P ⬍ 0.01), and a 50% reduction in PRL levels (139, 92–207;

Results Preoperative imaging

In 19 patients this was by computed tomography, and in 48 patients by magnetic resonance imaging scanning of the pituitary fossa. Macroadenomas were found in 42 patients (63%), and mesoadenomas in 8 patients (12%); patients with macroadenomas had significantly higher levels of GH compared with the 17 patients (25%) with microadenomas (Table 1). In patients with macroadenomas the mean age at presentation was significantly lower than in patients with microadenomas (41.0 ⫾ 14.3 vs. 49.8 ⫾ 10.9 yr, P ⬍ 0.05). There was no difference in gender, body mass index, or apparent duration of disease among patients with differently sized pituitary tumors. Preoperative evaluation of pituitary function

There was an overall significant correlation between each of the five single random GH measurements obtained from the GH day curve and the mean GH value (r ⫽ ⫹0.93 to 0.97, P ⬍ 0.0001), as well as between the maximum and the minimum value (r ⫽ ⫹0.94 to 0.98, P ⬍ 0.0001). However, within subjects there was a clear fluctuation of the single GH values around the mean, with a mean percentage distance from the maximum of 33 ⫾ 3% (range, 3–131) and from the minimum of 30 ⫾ 2% (range, 4 – 81; Fig. 1A). Preoperatively, no statistically significant difference in the percentage distance from the mean was found when patients were grouped according to the outcome of surgery. There was a weak but significant correlation between log mean GH and IGF-I levels (r ⫽ ⫹0.48, P ⬍ 0.01; Fig. 2), but there were large discrepancies in individual values in many patients. Thirty-seven patients in total were found to have at least one pituitary hormonal deficiency preoperatively. Ten patients had partial hypopituitarism (gonadotropin and ACTH deficiency), and one paTABLE 1. Preoperative GH and IGF-I levels (mean, Tumor Size

Patients

Microadenoma Mesoadenoma Macroadenoma

17 (25%) 8 (12%) 42 (63%)

a b

SEM,

FIG. 1. Preoperative (A) and postoperative (B) spread of random GH values (E) around their mean (f) in all patients. The GH levels are plotted in the common logarithmic scale.

median, and range) in relation to the size of the tumor on pituitary imaging

Preoperative GH Levels (mU/L)a

Preoperative IGF-I levels (mU/L)b

Median

25th percentile

75th percentile

Median

25th percentile

75th percentile

23.6 37.2 91.5

10.4 23.2 39.3

75.2 72.0 148.5

613 761 930

506 548 762

841 1124 1041

P ⬍ 0.001 (Kruskal-Wallis test for k independent samples). P ⬍ 0.05 (Kruskal-Wallis test for k independent samples).

1648

P ⬍ 0.01). Table 2 shows the percentages of patients who achieved a postoperative mean serum GH level of less than 10 mU/L and less than 5 mU/L, respectively. In this study, 23 patients (34%) achieved the safe postoperative mean GH level of less than 5 mU/L or less than 2 ng/mL (median, 2.2 mU/L; 25th–75th percentiles, 1.4 – 4.7 mU/L); thirteen additional patients (19%) had a reduction of serum GH levels to less than 10 mU/L; thirty-one patients (46%), while achieving a substantial reduction from preoperative mean GH levels [median, 92.6 mU/L (41.7–150) to 20.6 (14.1– 42.4)], had a postoperative mean GH higher than 10 mU/L. After surgery, 20 of the 23 patients who had achieved a postoperative GH less than 5 mU/L obtained a serum IGF-I value within the normal range adjusted for age (median, 251; range, 196 –358 ng/mL). In addition, there were eight patients with postoperative mean GH levels more than 5 mU/L (median, 15.6 mU/L) who also obtained postoperative IGF-I levels within the normal range (median, 266 ng/mL, 210 –301). Overall, 28 patients (42%) obtained a mean postoperative IGF-I level within the age-related normal range (median, 245 ng/mL; 25th–75th percentiles, 208 –295; Fig. 3). Figure 4 shows discrepancies between the two criteria for remission of disease, mean serum GH levels, and serum IGF-I, according to how long after TSS serum IGF-I was measured. The time of assessment did not significantly affect the remission rate (Fig. 4). Overall, 87% of patients with a mean GH less than 5 mU/L had a normal IGF-I, whereas, respectively, 85% and 81% of patients with mean GH 5–10 mU/L and more than 10 mU/L had high IGF-I levels, indicating that setting the remission

FIG. 2. Relation between serum IGF-I and log of mean GH on preoperative (F) and postoperative (E) assessment. TABLE 2. Preoperative variables (mean ⫾ adjusted plasma IGF-I

No. of patients Tumor sizea Micro Meso Macro Preoperative mean GHb Preoperative IGF-Ic PRL

SEM)

b

cut-off level at 5 mU/L (2 ng/mL) has its biological correlate in the normalization of IGF-I levels. In addition, the time of measurement of IGF-I seems not to be responsible for the discrepancies between the two criteria. There was a significant correlation between postoperative mean GH and post-

FIG. 3. Remission rate according to GH (u) or IGF ( ) postoperative levels, with the total number of tumors (䡺) related to size. Micro, Intrasellar microadenoma; Meso, mesoadenoma; Macro, macroadenoma.

FIG. 4. Percentage of patients with normal (f) or abnormal (u) ageadjusted IGF-I levels according to the postoperative mean GH levels and to the time when serum IGF-I levels were measured.

according to the outcome of surgery based on mean serum GH levels and normal age-

GH (⬍5 mU/L)

GH (5–10 mU/L)

GH (⬎10 mU/L)

Normal IGF-I

23

13

31

28

59% 25% 26% 31 (10.6 –92.9) 767 (599 –992) 196 (138 –285)

18% 37% 16% 56.8 (40.2–75.1) 892 (565–1166) 779 (391–1307)

P ⬍ 0.05. P ⬍ 0.001 in predicting postoperative mean GH and IGF group. c P ⬍ 0.05 in predicting postoperative IGF-I group only. a

JCE & M • 2001 Vol. 86 • No. 4

KALTSAS ET AL.

23% 37% 54% 91.5 (39.2–132) 862 (563–1040) 321 (177–733)

82% 25% 29% 36.8 (12.4 –92.9) 748 (567–918) 197 (160 – 651)

MEAN GH LEVELS AND OUTCOME OF SURGERY IN ACROMEGALY

operative IGF-I levels (r ⫽ ⫹0.68, P ⬍ 0.001) for the group as a whole (Fig. 2). The relationship was linear when serum GH was expressed in a logarithmic scale. Histopathological studies

Sixty-six of the 67 patients were found to have a pituitary tumor. In the single patient without obvious tumor plasma GHRH levels were normal, but GH levels did not improve after surgery. An area of lymphocytic hypophysitis and a Rathke’s pouch cyst were found in another two patients. In 43% of the patients immunostaining of their tumors was positive for both GH and PRL; in 28% for GH alone; in 10% for GH, PRL, and ␣-subunit; in 9% for GH and ␣-subunit; whereas in 3% it was positive for GH, PRL, ␣-subunit, and ␤-hCG. Tumors that immunostained positive for PRL were associated with higher serum PRL levels compared with those that were negative [531 mU/L (240 – 898) vs. 189 mU/L (157–321), P ⬍ 0.01]. However, there was no difference in PRL levels between patients who had micro- and macroadenomas [218 mU/L (161– 652) vs. 321 mU/L (190 –712), P ⫽ not significant]. Factors predicting outcome

The outcome for different tumor sizes is shown in Table 2. Preoperative tumor size seemed a major determinant of the outcome of surgery, as 10 of 17 (59%) patients who had microadenomas achieved postoperative GH levels less than 5 mU/L, compared with 11 of 42 (27%) patients with macroadenomas; also, 2 of 8 patients with mesoadenomas achieved postoperative GH levels less than 5 mU/L. Because tumor size correlated with pretreatment mean GH levels, patients who were rendered safe (mean, GH ⬍5 mU/L) had significantly lower mean preoperative GH levels [median 31 mU/L (25th–75th percentiles, 10.6 –92.9) vs. 78.5 mU/L (42.1–121.37), P ⬍ 0.01] compared with patients who were not. Furthermore, the size of the tumor correlated with the degree of pituitary deficiency preoperatively, with patients with larger tumors having more significant pituitary impairment than patients with smaller tumors. Lateral extension and cavernous sinus invasion were independent factors associated with a worse surgical outcome, as four of five patients who showed no change in postoperative GH level showed such involvement. A logistic regression analysis showed that tumor size was a significant determinant of outcome (b ⫽ ⫹0.70; se(b) ⫽ 0.29; P ⫽ 0.017), whereas preoperative IGF-I levels, although weakly correlated with tumor size (r ⫽ ⫹0.36, P ⬍ 0.05 Table 1), did not predict the surgical outcome (Table 2). Preoperative PRL levels were higher in patients who failed to achieve a surgical cure [498 mU/L (187– 857) vs. 196 mU/L (136 –315), P ⬍ 0.01]; furthermore, PRL levels correlated with the presence of positive tumor immunostaining for PRL rather than the size of the tumor. A multivariate logistic regression model was generated to evaluate the overall predictive value of preoperative mean GH, PRL, and tumor size on surgical outcome. Mean preoperative serum GH (b ⫽ ⫹2.16; se(b) ⫽ ⫹0.77; P ⫽ 0.005) and PRL (b ⫽ ⫹2.13; se(b) ⫽ ⫹0.83; P ⫽ 0.010) were significant independent predictors of the outcome, whereas tumor size was no longer significant, mainly because of its inter-

1649

action with the variable of serum GH. Interestingly, the effect of PRL on outcome was independent of tumor immunostaining, which was not a significant predictor. The incidence of noncurative surgery was not statistically related to study years. Mean GH measurement in normal subjects vs. patients with acromegaly

Spontaneous GH secretion in normal subjects (based on the five-point day curve) was characterized by discrete pulses separated by at least one nadir GH concentration below the limit of detection of the assay (⬍0.5 mU/L). By contrast, in all untreated patients with acromegaly the nadir GH was above the limit of sensitivity of the assay. Eight of the 23 patients with postoperative mean GH levels less than 5 mU/L showed at least one undetectable GH value; all these eight patients also showed normal IGF-I levels 5–180 days after surgery (Fig. 1B). Mean vs. single random GH measurement

There was a significant correlation between both pre- and postoperative mean GH and each single random GH, minimum GH, and maximum GH values (postoperative mean GH against maximum and minimum GH values r ⫽ ⫹0.97 and r ⫽ ⫹0.96, P ⬍ 0.01 respectively, Fig. 5). However, there were 9 of 23 patients (39%) who obtained a mean postoperative GH value less than 5 mU/L who had at least one single random GH value more than 5 mU/L; in addition, there were also 8 of 44 patients (18%) with a mean postoperative GH value more than 5 mU/L who had at least one single random GH value less than 5 mU/L (Table 3). Positive and negative predictive values for at least one of the random GH levels being above or below 5 mU/L during a day curve were, respectively, 80% and 63% when disease was assessed using the mean GH less than 5 mU/L, but 68% and 61% when disease was assessed by normalization of IGF-I levels. The sensitivity and specificity of mean GH against normalization of IGF-I were 92% and 71%, respectively, whereas positive and negative predictive values were 82% and 87%, respectively. Analysis of fluctuation of postoperative random GH

FIG. 5. Relation between the postoperative mean GH levels and the five GH values of the day curve on which the mean was obtained.

1650

KALTSAS ET AL.

TABLE 3. Patients with discrepant mean and random postoperative GH levels above or below 5 mU/L Mean postoperative GH (⬍5 mU/L)

Mean postoperative GH (⬎5 mU/L)

Patient no

Mean GH

Single GH

Patient no

Mean GH

Single GH

1 2 3 4 5 6 7 8 9

2.3 3.2 3.8 4.6 4.7 4.9 4.9 5.0 5.0

5.4 5.9 8.7 8.2 6.3 6.8 13.7 11.4 6.5

10 11 12 13 14 15 16 17

5.7 6.1 6.3 6.4 6.6 8 8.8 15

3.8 4.3 3.7 4.1 4.9 4.9 3.6 2.3

values around their mean showed that the percentage distances of the peak (55 ⫾ 6%) and the nadir (37 ⫾ 2%) from the mean GH value were significantly higher than they were preoperatively (P ⬍ 0.05; Fig. 1B). Furthermore, postoperatively the percentage variation from the mean was 1.5- to 2-fold higher in the group of patients in remission (from ⫺46 ⫾ 5% to 85 ⫾ 11%) than it was before surgery, or in patients with persistence of disease after TSS (from ⫺32 ⫾ 8% to 39 ⫾ 5%, respectively; P ⬍ 0.01). Discussion

The biochemical confirmation of a diagnosis of acromegaly is rarely difficult, and it is generally agreed that it is best established on the basis of inadequate suppression of serum GH during a standard OGTT and an elevated IGF-I (11); however, there is less concordance in the reported assessment of the disease activity that varies from random single estimates of serum GH, mean GH during a day profile, mean or nadir values during a standard OGTT, and, more recently, serum IGF-I (10, 20). TSS and selective adenomectomy offer the best chance for immediate satisfactory lowering of GH levels (19, 34, 44) because it may well achieve posttreatment GH levels less than 5 mU/L (2 ng/mL), defined as the mean serum GH of a series of GH measurements, which are associated with a mortality not significantly different from the background population (5). Using this therapeutic aim, 34% of our patients, the majority of which had macroadenomas, achieved a safe mean GH level after TSS, and such patients will not require further adjunctive treatment. Whereas our study shows a very high correlation between single random GH levels and mean GH levels in patients with acromegaly, it also demonstrates the wide variation of single random GH measurement, and identifies potential mistakes that can arise when only single random GH measurements are used in evaluating patients after surgery. Nine of the 23 patients, who were rendered safe following TSS according to the above criterion, would have been erroneously diagnosed as partially treated if based on a single GH measurement alone and may have received further unnecessary treatment with radiotherapy and/or medication; in addition, 8 patients who did not achieve a postoperative mean GH level of less than 5 mU/L would also have been erroneously considered as safe if based on a single GH measurement alone and would have escaped further

JCE & M • 2001 Vol. 86 • No. 4

treatment; such patients would be exposed to the risk of long-term complications and mortality associated with mean GH levels more than 5 mU/L (5). Based on the good correlation that exists between a single GH and mean GH values (14, 15, 20, 24, 44), several studies have previously questioned the necessity of mean GH values and have suggested reliance on single GH measurements in assessing disease activity in acromegaly (13, 15, 44). This was further reinforced when patients with acromegaly who were followed up with random single GH measurements were reported to have normal morbidity and mortality rates (18). However, secretion of GH is pulsatile, and only the mean of a series of GH values throughout the day has been shown to have highest sensitivity and specificity (24, 45); it has been reported that, based on single GH measurements alone, only 8 of 10 patients with active disease (high IGF-I) will be identified (24). Furthermore, in their long-term follow-up study based on a single random GH measurement, Abosch et al. (18) reported normal morbidity and mortality even in patients with GH values between 5 and 10 mU/L, a finding that differs from several other studies that have shown that only GH levels either less than 5 mU/L or less than 6 mU/L are associated with normal IGF-I secretion and mortality rates (5–7). Furthermore, in the study of Bates et al. (5), patients with mean GH levels more than 5 mU/L but less than 10 mU/L still showed increased morbidity and mortality rates. The contradiction in the study of Abosch et al. (18) was originally attributed to the fact that the majority of their patients harbored microadenomas, as assessed by older and less sensitive imaging techniques, and that the GH measurements were performed in a number of different laboratories and, thus, may not reflect current experience with sensitive imaging and assays (10). However, it is possible that the patients with single random GH levels less than 10 mU/L and normal mortality rates in long-term follow-up described by Abosch et al. (18) may well have had a mean GH value less than 5 mU/L; such a mean GH value would be compatible with normal survival irrespective of the fluctuation of random singleton GH values, as demonstrated in the patients who obtained a postoperative mean GH value less than 5 mU/L in our study (5). We have shown here that this fluctuation in GH values is, indeed, greater in those patients with safe mean GH levels. Singleton random GH measurements also do not provide information on GH secretory dynamics. Several authors consider the post-OGTT nadir GH value as the best indicator of disease activity in acromegaly, because it also evaluates GH secretory dynamics (10, 13, 14). Although post-OGTT nadir GH values were not routinely available in our study, there is a good correlation of both mean GH and post-OGTT nadir GH value with the IGF-I level (26). However, a potential limitation on relying solely on post-OGTT nadir GH levels is that the majority of information on morbidity and mortality rates in acromegaly has been obtained from studies that used either mean GH (5) or single random GH measurements to assess disease activity (6, 18); there is relatively little information based on post-OGTT nadir GH levels (7). A normal postoperative OGTT may not be predictive of long-term cure, in particular because patients with a “normal” OGTT and subsequent clinical recurrences have been described (46 – 48).

MEAN GH LEVELS AND OUTCOME OF SURGERY IN ACROMEGALY

Indeed, recent data showed that nadir GH levels after glucose suppression measured by sensitive assays in patients with persistent acromegaly are much lower than previously recognized (27). In addition, the mean GH level may also be able to provide information on GH dynamics, because only patients with acromegaly who were cured obtained occasionally undetectable (⬍0.5 mU/L) GH levels and, thus, showed restored GH secretory dynamics similar to that seen in normal subjects. Previous studies (19, 24, 45, 46) have demonstrated the pulsatile secretion of GH in patients with acromegaly. This is of particular interest because the increase in integrated GH concentration is accounted for mainly by an increase in the nonpulsatile fraction that most strongly correlates with serum IGF-I concentration (24, 45); twenty-four-hour sampling studies showed that in patients with acromegaly GH levels fail to fall to undetectable levels and the fluctuation of GH levels is usually around an increased baseline (24). Therefore, it is unlikely that a single GH measurement pretreatment will misclassify patients unless basal secretion is low. However, following successful treatment, patients with acromegaly may restore normal GH dynamics with a reduction of basal GH secretion, and a single GH may not reflect disease activity as shown from the results of our study. It seems reasonable to suggest that when assessing patients with active acromegaly a single random GH may suffice to reflect overall GH secretion, whereas when defining response to treatment and cure in patients with a substantial reduction of elevated GH levels, then a mean GH value has the appropriate sensitivity and specificity to reduce the number of false negative and/or false positive observations. IGF-I, which has a long half-life and is GH dependent, has been proposed as a useful marker of overall GH status and, hence, disease activity. Although initial studies showed that GH was not correlated with IGF-I, recent evidence (19, 24, 26, 46) has suggested that IGF-I measurements could be used to monitor disease activity overall and predict long-term outcome in acromegaly (7, 11). Using normalization of IGF-I levels following surgery as a definition of cure, 42% of our patients were cured as opposed to 34% when based on a mean GH curve less than 5 mU/L, a figure closer to most cure rates currently quoted (13, 14, 49). However, there are well documented discrepancies between mean GH and IGF-I levels in individual patients (15, 26, 32), even in those with postoperative GH levels less than 0.5 mU/L (24, 50). Until the significance of these findings with respect to long-term outcome is known, we suggest that estimates of serum GH should continue to be used in the follow-up of these patients in addition to IGF-I. Although there is a much more extensive body of data relating long-term prognosis in acromegaly to GH levels rather than IGF-I, current data suggest that both normalization of IGF-I levels and postoperative mean serum GH levels less than 5 mU/L are associated with normal morbidity and mortality rates; if both of these targets are achieved, then the evidence suggests that such patients will not require any immediate further treatment (7, 10, 20). The response to surgery was seen to depend principally on tumor size, serum GH, and PRL levels; as these are in turn correlated, surgical outcome can be related to serum GH and PRL levels alone. This is also true in relation to the presence of

1651

pituitary deficiencies. By contrast, we have found that basal preoperative serum IGF-I, which correlated poorly with tumor size, was unrelated to surgical outcome. Total IGF-I has a half-life a little less than 24 h, and we reasoned that measurement at a minimum of 1 week postoperatively should reflect surgical efficacy. Indeed, we found no difference in apparent cure rates assessed 1–3 weeks and more than 4 weeks after surgery, despite suggestion that such early assessment may be inaccurate (11). In a previous study, it was reported that early normalization of IGF-I was generally associated with persistent cure, although a gradual fall with time could also occur (30). Until data are available on longitudinal changes in time following surgery, we suggest that early assessment of postoperative IGF-I is prognostically useful. In summary, we suggest that the mean GH day curve currently remains a valuable way of assessing disease activity in acromegaly following surgery because it eliminates the limitations of random GH measurements, may reflect GH secretory dynamics, predicts normal IGF-I levels in the majority of cases, and provides information regarding longterm morbidity and mortality rates. Although random single GH measurements show a high correlation with mean GH levels, individual variations may lead to misdiagnosis of the true level of GH. Estimation of both mean GH and IGF-I levels may be necessary to follow-up patients with discordant results to optimize management. Preoperatively, the size of the tumor, and degree of anterior pituitary deficiency, GH, and PRL levels, but not IGF-I levels, can be used to help predict surgical outcome and, thus, identify in advance patients who will be likely to require further treatment to achieve safe GH levels. In practice, any postoperative mean GH more than 5 mU/L with an IGF-I level above the age-reference range clearly requires further treatment. When biochemical data are not congruent at an early assessment, we suggest reevaluation at 6 months, bearing in mind those conditions that may result in altered IGF-I levels. In our opinion, a mean serum GH more than 5 with IGF-I levels between the median and the upper limit of normal age reference range should be considered for medical treatment, whereas a mean GH more than 10 mU/L should be treated aggressively. In the presence of a mean GH less than 5 mU/L with elevated IGF-I levels, the decision is mainly clinical based on age and cardiovascular, metabolic, and tumoral comorbidity. References 1. Alexander L, Appleton D, Hall R, Ross WM, Wilkinson R. 1980 Epidemiology of acromegaly in the Newcastle region. Clin Endocrinol (Oxf). 12:71–79. 2. Ritchie CM, Atkinson AB, Kennedy AL, et al. 1990. Ascertainment and natural history of treated acromegaly in Northern Ireland. Ulster Med J. 59:55– 62. 3. Etxabe J, Gaztambide S, Latorre P, Vaquez JA. 1993 Acromegaly: an epidemiological study. J Endocrinol Invest. 16:181–187. 4. Bengtsson BA, Eden S, Ernest I, Oden A, Sjogren B. 1988 Epidemiology and long-term-survival in acromegaly. A study of 166 cases diagnosed between 1955 and 1984. Acta Med Scand. 223:327–335. 5. Bates AS, Van’t Hoff W, Jones JM, Clayton RN. 1993 An audit of outcome of treatment in acromegaly. Q J Med. 86:293–299. 6. Rajasoorya C, Holdaway IM, Wrightson P, Scott DJ, Ibbertson HK. 1994 Determinants of clinical outcome and survival in acromegaly. Clin Endocrinol (Oxf). 41:95–102. 7. Swearingen B, Barker FG, Katznelson L, et al. 1998. Long-term mortality after

1652

8. 9. 10. 11. 12. 13. 14. 15.

16. 17.

18. 19. 20. 21. 22. 23. 24. 25. 26.

27. 28. 29.

KALTSAS ET AL.

transsphenoidal surgery and adjunctive therapy for acromegaly [see comments]. J Clin Endocrinol Metab. 83:3419 –3426. Melmed S. 1990 Acromegaly. N Engl J Med. 322:966 –977. Sheppard MC. 1994 Aims of treatment and definition of cure. In: Wass JA, ed. Treating acromegaly. Bristol: Bioscientifica Ltd.; 103–108. Melmed S, Jackson I, Kleinberg D, Klibanski A. 1998 Current treatment guidelines for acromegaly. J Clin Endocrinol Metab. 83:2646 –2652. Giustina A, Barkan A, Casanueva FF, et al. 2000. Criteria for cure of acromegaly: a consensus statement. J Clin Endocrinol Metab. 85:526 –529. Trainer PJ, Drake WM, Katznelson L, et al. 2000. Treatment of acromegaly with the growth hormone-receptor antagonist pegvisomant [see comments]. N Engl J Med. 342:1171–1177. Osman IA, James RA, Chatterjee S, Mathias D, Kendall-Taylor P. 1994 Factors determining the long-term outcome of surgery for acromegaly. Q J Med. 87:617– 623. Jenkins D, O’Brien I, Johnson A, Shakespear R, Sheppard MC, Stewart PM. 1995 The Birmingham pituitary database: auditing the outcome of the treatment of acromegaly. Clin Endocrinol (Oxf). 43:517–522. Parfitt VJ, Flanagan D, Wood P, Leatherdale BA. 1998 Outpatient assessment of residual growth hormone secretion in treated acromegaly with overnight urinary growth hormone excretion, random serum growth hormone and insulin like growth factor-1. Clin Endocrinol (Oxf). 49:647– 652. Ross DA, Wilson CB. 1988 Results of transsphenoidal microsurgery for growth hormone-secreting pituitary adenoma in a series of 214 patients. J Neurosurg. 68:854 – 867. Tindall GT, Oyesiku NM, Watts NB, Clark RV, Christy JH, Adams DA. 1993 Transsphenoidal adenomectomy for growth hormone-secreting pituitary adenomas in acromegaly: outcome analysis and determinants of failure. J Neurosurg. 78:205–215. Abosch A, Tyrrell JB, Lamborn KR, et al. 1998 Transsphenoidal microsurgery for growth hormone-secreting pituitary adenomas: initial outcome and longterm results. J Clin Endocrinol Metab. 83:3411–3418. Barkan A, Beitins IZ, Kelch RP. 1988 Plasma insulin-like growth factor-I/ somatomedin-C in acromegaly: correlation with the degree of growth hormone hypersecretion. J Clin Endocrinol Metab. 67:69 –73. Bates AS, Evans DJ, Jones P, Clayton RN. 1995 Assessment of GH status in acromegaly using serum growth hormone, serum insulin-like growth factor-I and urinary growth hormone excretion. Clin Endocrinol (Oxf). 42:417– 423. Jaffe CA, Barkan AL. 1994 Acromegaly. Recognition and treatment. Drugs. 47:425– 445. Jasper H, Chervin A, Vitale M, Mella A, Ropelato G, Basso A. 1994 Biochemical markers of acromegalic activity. Medicina (B Aires). 54:25–30. Lamberts SW, del Pozo E. 1988 Somatostatin analog treatment of acromegaly: new aspects. Horm Res. 29:115–117. Ho KY, Weissberger AJ. 1994 Characterization of 24-hour growth hormone secretion in acromegaly: implications for diagnosis and therapy. Clin Endocrinol (Oxf). 41:75– 83. Stonesifer LD, Jordan RM, Kohler PO. 1981 Somatomedin C in treated acromegaly: poor correlation with growth hormone and clinical response. J Clin Endocrinol Metab. 53:931–934. Dobrashian RD, O’Halloran DJ, Hunt A, Beardwell CG, Shalet SM. 1993 Relationships between insulin-like growth hormone factor-1 levels and growth hormone concentrations during diurnal profiles and following oral glucose in acromegaly. Clin Endocrinol (Oxf). 38:589 –593. Freda PU, Post KD, Powell JS, Wardlaw SL. 1998 Evaluation of disease status with sensitive measures of growth hormone secretion in 60 postoperative patients with acromegaly. J Clin Endocrinol Metab. 83:3808 –3816. Freda PU, Wardlaw SL, Post KD. 1998 Long-term endocrinological follow-up evaluation in 115 patients who underwent transsphenoidal surgery for acromegaly [see comments]. J Neurosurg. 89:353–358. Levitt NS, Ratanjee BD, Abrahamson MJ. 1995 Do “so-called” normal growth hormone concentrations (2–5 micrograms/l) indicate cure in acromegaly? Horm Metab Res. 27:185–188.

JCE & M • 2001 Vol. 86 • No. 4

30. Yamada S, Aiba T, Takada K, et al. 1996. Retrospective analysis of long-term surgical results in acromegaly: preoperative and postoperative factors predicting outcome. Clin Endocrinol (Oxf). 45:291–298. 31. Holly JM, Cotterill AM, Jemmott RC, et al. 1991. Inter-relations between growth hormone, insulin, insulin like growth factor I (IGF-I), IGF binding protein (IGFBP-I) and sex hormone binding globulin in acromegaly. Clin Endocrinol (Oxf). 34:275–280. 32. Sheaves R. 1993 Interpretation of IGF-1 levels in the assessment of acromegaly. The Royal Society of Medicine, Current Medical Literature Series. Growth Hormone and Growth Factors. 8:59 – 61. 33. Hoffman DM, O’Sullivan AJ, Baxter RC, Ho KK. 1994 Diagnosis of growth hormone deficiency in adults. Lancet. 343:1064 –1068. 34. Sheaves R, Jenkins P, Blackburn P, et al. 1996. Outcome of transsphenoidal surgery for acromegaly using strict criteria for surgical cure. Clin Endocrinol (Oxf). 45:407– 413. 35. Trainer PJ, Besser GM. 1995 St. Bartholomew’s Hospital. The Bart’s endocrine protocols. Edinburgh: Churchill Livingstone. 36. Stewart PM, Smith S, Seth J, Stewart SE, Cole D, Edwards CR. 1989 Normal growth hormone response to the 75 g oral glucose tolerance test measured by immunoradiometric assay. Ann Clin Biochem. 26:205–206. 37. Hattori N, Shimatsu A, Kato Y, et al. 1990. Growth hormone responses to oral glucose loading measured by highly sensitive enzyme immunoassay in normal subjects and patients with glucose intolerance and acromegaly. J Clin Endocrinol Metab. 70:771–776. 38. Chapman IM, Hartman ML, Straume M, Johnson ML, Veldhuis JD, Thorner MO. 1994 Enhanced sensitivity growth hormone (GH) chemiluminescence assay reveals lower postglucose nadir GH concentrations in men than women. J Clin Endocrinol Metab. 78:1312–1319. 39. Morrell DJ, Dadi H, More J, et al. 1989. A monoclonal antibody to human insulin-like growth factor-I: characterization, use in radioimmunoassay and effect on the biological activities of the growth factor. J Mol Endocrinol. 2:201–206. 40. Ross RJM, Grossman AB, Bouloux PM, Rees LH, Doniach I, Besser GM. 1985 The relationship between serum prolactin and immunocytochemical staining for prolactin in patients with pituitary macroadenoma. Clin Endocrinol (Oxf). 23:227–235. 41. Lidhar K, Korbonits M, Jordan S, et al. 1999. Low expression of the cell cycle inhibitor p27kipl in normal corticotroph cells, corticotroph tumors, and malignant pituitary tumors. J Clin Endocrinol Metab. 84:3823–3830. 42. Siegel S. 1956 Nonparametric statistics for the behavioral sciences. New York: McGraw Hill. 43. Hosmer DW, Lemeshow S. 1989 Applied logistic regression. New York: Wiley. 44. Barkan A, Halasz I, Doronfeld KS, et al. 1997 Pituitary irradiation is ineffective in normalizing plasma insulin-like growth factor I in patients with acromegaly. J Clin Endocrinol Metab. 82:3187–3191. 45. Semer M, Faria AC, Nery M, et al. 1995 Growth hormone pulsatility in active and cured acromegaly subjects. J Clin Endocrinol Metab. 80:3767–3770. 46. Hartman ML, Veldhuis JD, Vance ML, Faria AC, Furlanetto RW, Thorner MO. 1990 Somatotropin pulse frequency and basal concentrations are increased in acromegaly and are reduced by successful therapy. J Clin Endocrinol Metab. 70:1375–1384. 47. Arafah BM, Rosenzweig JL, Fenstermaker R, Salazar R, McBride CE, Selman W. 1987 Value of growth hormone dynamics and somatomedin C (insulin-like growth factor I) levels in predicting the long-term benefit after transsphenoidal surgery for acromegaly. J Lab Clin Med. 109:346 –354. 48. Schuster LD, Bantle JP, Oppenheimer JH, Seljeskog EL. 1981 Acromegaly: reassessment of the long-term therapeutic effectiveness of transsphenoidal pituitary surgery. Ann Intern Med. 95:172–174. 49. Fahlbusch R, Honegger J, Buchfelder M. 1997 Evidence supporting surgery as treatment of choice for acromegaly. J Endocrinol. 155(Suppl 1):S53–S55. 50. Lindholm J, Giwercman B, Giwercman A, Astrup J, Bjerre P, Skakkebaek NE. 1987 Investigation of the criteria for assessing the outcome of treatment in acromegaly. Clin Endocrinol (Oxf). 27:553–562.