Mortality Associated with Psychiatric Disorders in ... - Semantic Scholar

2 downloads 45 Views 72KB Size Report
Alexander H. Leighton7. This paper applies new statistical ...... Leighton DC, Harding JS, Macklin DB, et al. The character of danger: the Stirling County Study of ...
American Journal of Epidemiology Copyright © 2001 by the Johns Hopkins University Bloomberg School of Public Health All rights reserved

Vol. 154, No. 7 Printed in U.S.A.

Multiple Informants: Mortality Associated with Psychiatric Disorders in the Stirling County Study

Nicholas J. Horton,1,2 Nan M. Laird,3 Jane M. Murphy,4–6 Richard R. Monson,6 Arthur M. Sobol,4 and Alexander H. Leighton7 This paper applies new statistical procedures for analyzing multiple-source information about the relation of psychiatric diagnoses to mortality. The data come from the Stirling County Study, a longitudinal community investigation of adults, that collected multiple-source reports (self-report and physician-report) about psychiatric disorders. These reports are used as predictors of mortality risk over a 16-year follow-up period (1952–1968). Despite extensive efforts, one or both of these reports were sometimes missing. Missingness of self-report was related to demographic characteristics as well as to physician-reports of psychiatric diagnosis. The statistical procedures used here draw together into a single frame of reference both informant reports for the initial Stirling survey and relate these to mortality risk using weighted generalized estimating equation regression models for time to event data. This unified method has two advantages over traditional approaches: 1) the relative predictiveness of each informant can be assessed and 2) all subjects contribute to the analysis. The methods are applicable to other areas of epidemiology where multiple informant reports are used. The results for self-reports and physician-reports of disorders were comparable: Psychiatric diagnosis was associated with higher mortality, particularly among younger subjects. Am J Epidemiol 2001;154:649–56. estimating equation; missing data; regression

Many psychiatric epidemiologic studies of adults in the community utilize a single source of information, typically by interviewing a randomly selected member of a household about his or her psychiatric symptoms (1–7). An exception is the Stirling County Study, a 40-year study of a general population (8 –10) that has made use of information collected from two informants (or sources). One is the typical household survey, but the other is an interview with general physicians about the same persons.

These multiple reports of psychiatric disorders are used in conjunction with information on follow-up to understand the association between psychiatric disorders and mortality risk in a community population. In recent years, multiple-informant reports have been used in epidemiologic studies of child psychiatric disorders (11–17). The rationale is that each source of information— parent, teacher, child—might have a unique perspective and that a complete picture would not be possible unless each was taken into account. By collecting reports from several sources, one expects that psychopathology can be more accurately and reliably determined (18). An inherent feature of multiple-informant data is that one anticipates discordant reports. If there is no discordance, the additional reports provide no new information. There have been a number of papers, primarily focused on studies of children, that review methods to integrate reports from multiple informants (19 –21). A commonly used approach involves a separate analysis for each of the informants (22–24). This approach has some limitations, including how to compare results from the separate models. Horton et al. (17) describe methods for incorporating multiple-informant reports as predictor variables (covariates) in regression modeling that overcome the limitations of common approaches. In earlier reports of the Stirling County Study, psychiatric cases were identified by having psychiatrists review the materials assembled from both sources (25). Later work focusing on the common theme of the relations between

Received for publication September 8, 2000, and accepted for publication March 12, 2001. Abbreviations: DPAX, self-reported data on depression and anxiety; GEE, generalized estimating equation; GP-DATA, general physician data. 1 Department of Epidemiology and Biostatistics, Boston University School of Public Health, Boston, MA. 2 Department of Medicine, Boston University School of Medicine, Boston, MA. 3 Department of Biostatistics, Harvard School of Public Health, Boston, MA. 4 Department of Psychiatry, Massachusetts General Hospital, Boston, MA. 5 Department of Psychiatry, Harvard Medical School, Boston, MA. 6 Department of Epidemiology, Harvard School of Public Health, Boston, MA. 7 Departments of Psychiatry and of Community Health and Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Nova Scotia, Canada. Correspondence to Dr. Nicholas Horton, Department of Epidemiology and Biostatistics, Boston University School of Public Health, 715 Albany St. T3E, Boston, MA 02118 (e-mail: [email protected]).

649

650

Horton et al.

psychiatric disorders and mortality analyzed the informants separately. One paper was concerned with the relations between self-reported disorders and mortality (26). When such disorders were aggregated using the self-report information, the standardized mortality ratio was 1.5. Another paper addressed the association of physician-reported disorders and mortality (27). When disorders identified by physicians were similarly aggregated, the standardized mortality ratio was 1.6. Both investigations indicated that psychiatric disorder was significantly associated with mortality and that subjects who died before reaching the age of 50 years were especially likely to have had a psychiatric disorder. A practical difficulty in analyzing multiple-source reports is that there is often a substantial amount of missing data (21, 28). Multiple-informant reports are commonly missing since, by definition, they are collected from sources other than the primary subject of the study. Missingness can induce bias as well as loss of inferential efficiency (29, 30). In psychiatric epidemiology, results can be seriously biased if subjects who are missing information suffer more psychopathology than those for whom it was possible to gather data. For example, the National Comorbidity Survey (6) showed that nonrespondents have significantly more psychopathology than respondents have, which emphasizes the importance of adjustment for differential nonresponse. In the Stirling County Study, some subjects were missing one or both of their reports of psychiatric problems. Our hypothesis is that the absence of physician-reports is not related to whether the subject had a psychiatric disorder or not, and that missing reports were due to clerical omissions. On the other hand, we hypothesize that the absence of interview (self-report) data can be related to a subject’s psychiatric status. A person with a psychiatric disorder may be more likely to refuse the interview or to be repeatedly unavailable. Based on the availability of advanced statistical methods, this is the first report from the Stirling County Study in which the two sources are drawn together in such a way that the measured effects of each can be compared. The question of interest is to quantify the effect of psychiatric morbidity on subsequent mortality. In this paper, we apply new statistical procedures for drawing together into a single frame of reference the information from the subjects and physicians of the Stirling County Study as gathered in the first survey that was conducted in 1952, as well as information about mortality over the following 16 years. We describe unified regression models that jointly model the associations between the reports of psychiatric disorders and subsequent death. These models allow separate regressions to be analyzed together, tested for differences, and simplified if appropriate to determine a final overall regression. Partially observed multiple-source reports may be incorporated into these regression models to account for differential missingness.

of “Stirling” is used to protect identity (31). The area has undergone many of the influences of urbanism and has experienced the major social trends exhibited throughout North America (10, 32). The study consists of repeated cross-sectional surveys, as well as cohort follow-up investigations in which subjects from earlier samples were reassessed or certified for death at each field phase when a new sample was drawn. For the 1952 survey, 1,098 subjects were selected by probability sampling. Among these, 1,003 provided a personal interview (91 percent completeness), and 1,029 were known to at least one physician who was able to provide a report (94 percent completeness). Although 953 subjects had both types of information available, 1,079 had at least one report available (50 with only a self-report and 76 with only a physician-report). These 1,079 subjects constitute the sample of interest in the paper. Measures

The personal interview data were processed by a computer algorithm for self-reported data on depression and anxiety (DPAX), “DP” standing for depression and “AX” for anxiety (33). The algorithm has steps for determining that the syndromes of depression and/or anxiety are reasonably complete, that they persisted over a minimal duration of 1 month, and that they were associated with impairment in everyday functioning. Ten physicians contributed information for this study, each having been interviewed by a psychiatrist who asked about syndrome completeness, duration, and impairment as components of diagnosis. While the self-report interviews focused on disorders involving depression and anxiety, the physician interviews covered the full range of psychiatric disorders. The separate physicians are not distinguished in this study, and their collective information is described as general physician data (GPDATA). In this paper, our focus is on whether the subject was identified as having a psychiatric disorder rather than on what type of psychiatric diagnosis pertained. In addition to using the case assessments from the two sources, we used the demographic variables gender and age. Age in 1952 was divided into three categories: ⬍50, 50 – 69, and ⱖ70 years, because overall mortality is associated with age. Outcome

June 1968 was taken as the common closing date for the search of death certificates. For the large majority of subjects who died in the province where “Stirling County” is located, death certificates were found. For those who died elsewhere, information was drawn from relatives, neighbors, physicians, and family bibles (26, 27). Complete information on vital status at this time was achieved. Approximately one quarter (n ⫽ 270) of the sample died during this 16-year follow-up period.

MATERIALS AND METHODS

Statistical methods

Study sample

We use survival regression models as described by Laird and Olivier (34) to assess the magnitude and significance of the relation between independent variables and mortality.

The site of the Stirling County Study is a county of 20,000 residents in Atlantic Canada, and the fictitious name

Am J Epidemiol Vol. 154, No. 7, 2001

Multiple Informants of Psychiatric Disorders

These methods specify a piecewise exponential survival distribution and generate maximum likelihood estimates and associated tests. They approximate proportional hazards regression methods but have the advantage that we can incorporate multiple-informant models, using generalized estimating equations (35, 36) (GEEs) for parameter estimation. More formally, we assume that time in the study can be partitioned into J mutually exclusive, exhaustive intervals ⍀1, ..., ⍀J, with a constant hazard function within each interval. We observe (Yij, Tij) for i ⫽ 1, ..., n and j ⫽ 1, ..., J, where Yij denotes whether the ith subject died during the jth period, and Tij denotes the time at risk for the ith subject during the jth period. Here Yij ⫽ 1 for some i and j implies that Ti,k ⫽ 0 for k ⫽ j ⫹ 1, ..., J. We divided the follow-up period into J ⫽ 4 intervals, each having a duration of 4 years, and allowed the constant rate parameter to vary for each interval. The last interval (interval 4) was used as the reference group. Other predictors were assumed to be constant over time (i.e., no interaction with interval). Laird and Olivier show that treating each Yij as a Poisson random variable: Yij ⬃ Poisson(Tij aij), where the model for the rate is specified by

冢冣

ai1 ai2 ·· ⫽ ai ⫽ exp共Xⴕi␤兲, · aiJ

yields the correct piecewise exponential likelihood. Here Xi is a vector of covariates for the ith subject and ␤ denotes a set of regression parameters for the rate. We denote the expected number of events:

mi ⫽ E关Yi兩Ti , Xi , ␤兴 ⫽ Ti ai ⫽ Ti exp共Xⴕi␤兲, and we model

log共mi兩Ti , Xi , ␤兲 ⫽ log共Ti兲 ⫹ Xⴕi␤, where log(Tj) is a known offset. Maximum likelihood estimation of the parameters of log(mi) is straightforward to calculate in existing Poisson regression software packages (e.g., PROC GENMOD in SAS) (37) for a single informant and fully observed predictors. We extend the model, by considering whether use of a particular informant will yield different results for the overall regression model, using techniques proposed independently by Pepe et al. (38) and Horton et al. (17). Let X1 and X2 denote two multiple-informant reports where, for our example, X1 is the GP-DATA (physician-report) and X2 is the DPAX (self-report). We denote by Z a vector of other covariates and by (Y, T) indicators of survival status and follow-up time. We model the hazard for mortality as a function of the reports of psychiatric disorders and other covariates. Fitting separate regression models is equivalent to specifying

log共mi兩Ti , X1i , Zi , ␤兲 ⫽ log共Ti兲 ⫹ ␤0 ⫹ ␤1X1i ⫹ ␤2Zi (1) Am J Epidemiol Vol. 154, No. 7, 2001

651

and

log共mi兩Ti , X2i , Zi , ␣, ␤兲 ⫽ log共Ti兲 ⫹ 共␣0 ⫹ ␤0兲 ⫹ 共␣1 ⫹ ␤1兲 X2i ⫹ 共␣2 ⫹ ␤2兲Zi (2) where other variables and interactions may be included. A limitation of this approach has been the lack of a method for deciding if regression coefficients are different in the two models and how to combine results if they are not. Pepe et al. (38) and Horton et al. (17) showed how the GEE regression model can be used with existing general purpose statistical software (39), including PROC GENMOD in SAS, to fit models that allow the same parameters to appear in both equations. For example, it might be assumed that there is no effect of informant on the associations between the covariates Zi and the outcome. Under this assumption, ␣2 would be presumed to be 0. In this case, the regression model is simplified:

log共mi兩Ti , X1i , Zi , ␤兲 ⫽ log共Ti兲 ⫹ ␤0 ⫹ ␤1X1i ⫹ ␤2Zi

(3)

log共mi兩Ti , X2i , Zi , ␣, ␤兲 ⫽ log共Ti兲 ⫹ 共␣0 ⫹ ␤0兲 ⫹ 共␣1 ⫹ ␤1兲 X2i ⫹ ␤2Zi . The ␣1 parameter assesses how the association between diagnosis and outcome varies by informant. If this parameter is significantly different from 0, this would imply that there is a different mortality risk associated with one of the informant reports. This might be due, in our example, to the physician-report’s being associated with a more serious type of disorder. The ␣0 parameter in this model denotes the difference in baseline log rate using the second informant. This may differ from 0 if the informant effects are estimated from different subsets of the data, and if missingness is related to the outcome. While the same outcome is used in both regression equations, the distribution of Y in the two regression models is different as long as the predictors are different. Including separate parameters for models 1 and 2 implies that the use of different sources leads to different inferences about the model, while assuming that ␣ ⫽ 0 suggests that the data from different informants lead to the same inferences concerning the association of diagnosis and other predictors of mortality. A limitation of the use of separate regression models is that, if some of the informant reports X1 and X2 are missing, then models 1 and 2 will be fit using different subsets of the sample. The GEE permits a subject to contribute to one equation and not to the other, but use of these “available case” methods may be inefficient, as well as biased if missingness is not completely at random (30). Xie and Paik (40) proposed a simple approach to handling missing covariates in GEE models when the probability of missingness depends on the outcomes and/or observed covariates. This method assumes that the missingness mechanism is missing at random (29): Miss-

652

Horton et al.

ingness of the unobserved covariate does not depend on the value of the unobserved covariate. In certain settings, the model reduces to a weighted GEE (41, 42). We define the missingness indicators

R1i ⫽

再 01

if X1i is observed if X1i is missing

R2i ⫽

再 01

if X2i is observed if X2i is missing .

and

Under the missing at random assumption, we have that

p1 ⫽ Pr共R1 ⫽ 1兩Y, T, X1 , X2 , Z兲 ⫽ Pr共R1 ⫽ 1兩Y, T, X2 , Z, ␥1兲,

(4)

and

p2 ⫽ Pr共R2 ⫽ 1兩Y, T, X1 , X2 , Z兲 ⫽ Pr共R2 ⫽ 1兩Y, T, X1 , Z, ␥2兲.

(5)

Following the approach of Xie and Paik, we use logistic regression models, governed by ␥, for fitting models 4 and 5 and estimating p1 and p2. The inverses of pˆ1 and pˆ2 are used as weights in the GEE analysis; that is, pˆ⫺1 weights the equation with X1 as a covariate and pˆ⫺1 for the equation 2 with X2. We assume a working independence correlation structure, which reduces to standard Poisson regression with estimated weights. Subjects with two informants contribute two observations; those missing one contribute one observation. We use the bootstrap (43) to estimate the standard errors of ␣ and ␤. We create a data set of size 1,079 by randomly sampling persons with replacement from the original data set and refit the same model using this new sample. This procedure is repeated 1,000 times, with the estimated standard error calculated from the sample variance from these 1,000 data sets. With time to event data, it is not possible to fit a saturated logistic regression model for models 4 and 5. Instead, we categorized the survival times into J ⫹ 1 groups (where the time of death is divided into J intervals plus 1 for survivors) and treat the distribution as a multinomial. We also dropped higher order interactions from the logistic regression model. With simplification of the missingness law in these ways, there is a potential increase in bias due to misspecification of the model for weights. The overall goal is to provide a parsimonious yet flexible representation of the missingness mechanism. A more serious complication is that missingness is not monotone: Some subjects are missing DPAX while others are missing GP-DATA. Estimation of model 4, the probability of observing X1, is complicated because X2 is not fully observed. Similar complications arise in the estimation of the probability of observing X2 (pˆ2). In our example, the DPAX report (X2) was not predictive of missingness for GP-DATA, so we dropped X2 from model 4 and estimated pˆ1 from the full sample. This was not the case for model 5, because GP-DATA (X1) were predictive of missingness for

DPAX (X2). We gave weight 1 to observations with missing predictors in the missingness model for pˆ2. RESULTS

The majority of the sample in 1952 is female (54 percent), and the mean initial age (in 1952) was 48.5 (standard deviation, 16.2; range, 17–92) years. Table 1 displays the contingency table representing the GP-DATA and DPAX reports. Among the 1,079 subjects with data from one or both sources, the physicians reported a prevalence rate of psychiatric disorders of 14 percent, while the prevalence using self-report was 12 percent. The agreement between GP-DATA and DPAX was low (44) (␬ ⫽ 0.18). We note that 12 percent (6 of 50) of the subjects with missing GP-DATA reports had positive DPAX reports and that 24 percent (18 of 76) of the subjects with missing DPAX reports had positive GP-DATA reports. We tested whether age, gender, DPAX report, or mortality during the follow-up period was predictive of observing the physician-report for those subjects with a complete DPAX report. None of these factors was significantly associated with observing the physician-report using a likelihood ratio test: ␹25 ⫽ 4.16, p ⫽ 0.53. All p values for main effects were p ⬎ 0.30. These results suggest that missing physician-reports are not inconsistent with random processes and may be missing completely at random. We also tested whether the probability of observing the DPAX diagnosis was associated with age, gender, their interaction, positive GP-DATA report, or mortality during the follow-up period using subjects with a complete GPDATA report. We rejected the null hypothesis that the missingness was missing completely at random (␹27 ⫽ 19.88, p ⫽ 0.006). There was a significant interaction between age and gender (␹22 ⫽ 6.90, p ⫽ 0.03), as well as a significant positive association between the GP-DATA report and DPAX missingness (␹21 ⫽ 5.56, p ⫽ 0.02). Mortality during the follow-up period was not significantly associated with missing the DPAX report (␹21 ⫽ 0.55, p ⫽ 0.46). Table 2 displays the estimated probabilities of missing the DPAX report, by age group, gender, and GP-DATA report for subjects that did not die during the follow-up period. Younger women have lower estimated missingness probability than younger men have, but older women have a higher missingness fraction than older men have, and TABLE 1. Cross-classification of GP-DATA* and DPAX* reports of psychiatric disorders (including partially observed subjects), Stirling County Study, 1952–1968 DPAX GP-DATA

Missing report No disorder Report of disorder

Missing report

No disorder

0 58 18

44 748 90

Report of disorder

6 81 34

Total

50 887 142

Total 76 882 121 1,079 * GP-DATA, general physician data; DPAX, self-reported data on depression and anxiety.

Am J Epidemiol Vol. 154, No. 7, 2001

Multiple Informants of Psychiatric Disorders TABLE 2. Predicted percentage missing DPAX* report, by age, gender, and GP-DATA* report (for subjects that survived the follow-up period), Stirling County Study, 1952–1968 Female GP-DATA

⬍50 years

50–69 years

Male ⱖ70 years

⬍50 years

50–69 years

ⱖ70 years

No disorder 1.3 3.1 4.6 3.2 2.1 3.0 Report of disorder 5.1 11.4 16.2 11.6 8.0 11.1 * DPAX, self-reported data on depression and anxiety; GPDATA, general physician data.

missingness is higher for subjects with a positive GP-DATA report. These results indicate that missingness of the DPAX report is related to factors of interest in the study: age, gender, and physician diagnosis of psychiatric disorder. Regression models for overall mortality

We begin by considering models for the probability of dying during the 16-year follow-up period, given age at baseline, gender, and psychiatric disorder status. The youngest subjects and males were used as the reference groups in the regression models. We first consider separate regression models for GP-DATA and DPAX. To account for the partially observed subjects, we weighted observations using the inverse of the probability of observing that value. We estimated these weights using a series of logistic regression models. For the probability of observing GP-DATA reports, this model included main effects for age (2 df), survival status (4 df), and gender (1 df). For the probability of observing DPAX reports, this model included main effects of age, gender survival status, and GP-DATA report, as well as interactions between gender and age and GP-DATA report and age. The goal is to model the missingness in a relatively rich fashion to avoid introducing bias; hence, some nonsignificant predictors were retained in these regressions. Results for GP-DATA are provided in table 3, where the parameter estimates are log annual mortality rate ratios. To aid in interpreting these results, table 4 displays the estimated annual mortality rate for a younger male (⬍50 years) and an older male (ⱖ70 years) with and without a psychiatric diagnosis using GP-DATA reports during the first time interval. Here the estimated log rate for a younger male subject with no diagnosis is

log共rate兲 ⫽ ⫺5.70 ⫺ 0.95 ⫽ ⫺6.65, and the predicted annual mortality rate is exp(⫺6.65) ⫽ 0.00129. As expected, the force of mortality tends to increase with age. Older subjects and those with a psychiatric diagnosis have a significantly higher rate of mortality, but the association of a psychiatric diagnosis report with mortality is significantly larger for younger subjects. Results for the DPAX separate regression (not reported here) are similar, but there is also a small effect of gender in the model: The difference in log rates for women is ⫺0.224 (95 percent confidence interval: ⫺0.489, 0.041). This may Am J Epidemiol Vol. 154, No. 7, 2001

653

TABLE 3. Piecewise exponential survival regression results using the GP-DATA* report (parameter estimates represent log annual mortality rate ratios, n ⴝ 1,029), Stirling County Study, 1952–1968 Parameter

Estimate (SE*)

95% CI*

Intercept

⫺5.70 (0.27)

⫺6.24, ⫺5.16

Interval 1 2 3 4

⫺0.95 (0.18) ⫺0.53 (0.17) ⫺0.38 (0.16) 0.00

⫺1.30, ⫺0.61 ⫺0.86, ⫺0.20 ⫺0.70, ⫺0.07

Gender Female Male

⫺0.12 (0.13) 0.00

⫺0.37, 0.13

Age (years) ⬍50 50–69 ⱖ70

0.00 2.56 (0.27) 3.75 (0.27)

2.03, 3.09 3.23, 4.28

Diagnosis

1.69 (0.39)

0.93, 2.46

Diagnosis ⫻ age (years) ⬍50 0.00 50–69 ⫺1.15 (0.45) ⫺2.03, ⫺0.26 ⱖ70 ⫺1.17 (0.48) ⫺2.12, ⫺0.22 * GP-DATA, general physician data; SE, standard error; CI, confidence interval.

be due in part to the use of different subsamples used to estimate this model, since older females with positive GPDATA reports were more likely to be missing their DPAX reports. We next considered models using all subjects where the same outcome (indicator of survival and follow-up time) is repeated, but each subject with complete data has two observations (one with the GP-DATA report and one with the DPAX report), and those with a missing report contribute only one. We fit a GEE model using a working independence correlation structure, as displayed in table 5. This model allows the rate to vary by informant. Since the GP-DATA report is the baseline group, the intercept in the GEE model reported in table 5 (⫺5.70) is the same as the intercept for the GP-DATA model reported in table 3. Similarly, the estimated intercept for DPAX reports in table 5 (⫺5.70 ⫹ 0.22 ⫽ ⫺5.48) is the same as the estimate from the separate DPAX model. The estimates of ␣ (for the main effects and interactions) were close to 0, so these nonsignificant terms were dropped from the model. The results from the final GEE model are displayed in table 6. We note that the parameter estimates for this model are roughly comparable to an average of the other models using GP-DATA only or DPAX only but that the standard error estimates of the GEE model are generally smaller. Overall, the GEE model provides a parsimonious summary of the association between diagnosis and overall mortality, while controlling for age and gender. This model also incorporates subjects that are partially observed. To assess the effects of weighting the observations to

654

Horton et al.

TABLE 4. Predicted annual mortality rate for a male subject of two different ages (during interval 1 using the GP-DATA* report), Stirling County Study, 1952–1968 Age (years) ⬍50

GP-DATA 0 0.001 Diagnosis 1 0.007 * GP-DATA, general physician data.

0.056 0.093

DISCUSSION

It has long been suspected that nonresponse patterns are different for those with and without psychiatric problems. TABLE 5. Piecewise exponential survival GEE* models with repeated outcomes and informant effects (parameter estimates represent log annual mortality rate ratios, n ⴝ 1,079), Stirling County Study, 1952–1968 Estimate (SE*)

95% CI*

Intercept

⫺5.70 (0.27)

⫺6.24, ⫺5.16

Interval 1 2 3

⫺0.95 (0.18) ⫺0.53 (0.17) ⫺0.38 (0.16)

⫺1.29, ⫺0.60 ⫺0.85, ⫺0.20 ⫺0.70, ⫺0.07

Gender (female)

⫺0.12 (0.13)

⫺0.37, 0.13

Age (years) 50–69 ⱖ70

2.56 (0.27) 3.75 (0.27)

2.03, 3.09 3.23, 4.27

Diagnosis

1.69 (0.39)

0.93, 2.46

Diagnosis ⫻ age (years) 50–69 ⱖ70

⫺1.15 (0.45) ⫺1.17 (0.48)

⫺2.03, ⫺0.26 ⫺2.12, ⫺0.23

0.22 (0.21)

⫺0.19, 0.63

Diagnosis ⫻ informant (DPAX)

⫺0.28 (0.54)

⫺1.34, 0.78

Gender ⫻ informant (female DPAX)

⫺0.10 (0.07)

⫺0.23, 0.03

Age (years) ⫻ informant (DPAX) 50–69 ⱖ70

⫺0.11 (0.17) ⫺0.19 (0.18)

⫺0.43, 0.22 ⫺0.55, 0.17

Informant (DPAX*)

Estimate (SE*)

95% CI*

Intercept

⫺5.58 (0.25)

⫺6.07, ⫺5.09

Interval 1 2 3 4

⫺0.94 (0.17) ⫺0.52 (0.17) ⫺0.38 (0.16) 0.00

⫺1.28, ⫺0.60 ⫺0.85, ⫺0.19 ⫺0.70, ⫺0.06

Gender Female Male

⫺0.18 (0.13) 0.00

⫺0.43, 0.07

Parameter

ⱖ70

account for the missingness, we compared the results from the weighted models with those from the unweighted models (not reported here). The maximum absolute deviation was 0.082, and the maximum relative deviation was 27.1 percent (for one of the estimates of interval). In this setting, because missingness was not related to the outcome, the regression inferences were similar for the weighted and unweighted estimates.

Parameter

TABLE 6. Piecewise exponential survival GEE* models with repeated outcomes and no informant effects (parameter estimates represent log annual mortality rate ratios, n ⴝ 1,079), Stirling County Study, 1952–1968

Diagnosis ⫻ age (years) ⫻ informant (DPAX) 50–69 ⫺0.07 (0.45) ⫺0.95, 0.81 ⱖ70 0.29 (0.53) ⫺0.75, 1.33 * GEE, generalized estimating equation; SE, standard error; CI, confidence interval; DPAX, self-reported data on depression and anxiety.

Age (years) ⬍50 50–69 ⱖ70

0.00 2.50 (0.24) 3.65 (0.24)

2.03, 2.98 3.18, 4.12

Diagnosis

1.55 (0.31)

0.95, 2.15

Diagnosis ⫻ age (years) ⬍50 0.00 50–69 ⫺1.19 (0.34) ⫺1.87, ⫺0.52 ⱖ70 ⫺1.03 (0.38) ⫺1.77, ⫺0.30 * GEE, generalized estimating equation; SE, standard error; CI, confidence interval.

The National Comorbidity Survey provided evidence for this effect, with people who were initially missing tending to be more ill (6). If not accounted for, biased estimates of prevalence and associations may result. In the Stirling County Study, we showed that missingness was related to demographic factors that are commonly collected (such as age and gender). In addition, missingness of self-reports was predicted by physician-reports of psychiatric disorders: The people you cannot interview in a household survey are not a random subset according to what you know from physicians. Our results were not qualitatively different when accounting for missingness, partly because missingness was not related to overall mortality. However, missingness of selfreports was predicted by physician-reports of psychiatric disorder. According to the physician’s report, the subjects who could not be interviewed were more likely to suffer a psychiatric disorder than were those who were successfully interviewed. This bias appeared despite the high completion rates achieved in the Stirling County Study. Further, it appeared in the unusual situation of having as knowledgeable an informant as a physician give psychiatric information about the randomly chosen subjects. These factors emphasize that missingness and incomplete data are issues that need to be addressed in studies of the epidemiology of psychiatric disorders. In psychiatric epidemiology, researchers are often interested in the results of separate regression models based on different reports of psychiatric states. These models differ Am J Epidemiol Vol. 154, No. 7, 2001

Multiple Informants of Psychiatric Disorders

from approaches whereby both informants might be included in a regression model. Interpretability of the latter models can be problematic: How should one interpret the “effect” of a positive self-report while holding the physician-report constant? Separate regression models for each informant are straightforward to fit but difficult to interpret if they provide differing results. We have proposed methods using a single model that allows for statistical comparison of differences. These methods are broadly applicable to other epidemiologic investigations that utilize multiplesource reports, such as child psychiatric epidemiology, nutritional epidemiology, and geriatric studies. One product of this analysis is the finding that there are similar associations between mortality and either self-report or physician-report of psychiatric disorders. The separate analyses reported earlier pointed in the same direction, but it might have been assumed from them that the similarity was due to the fact that the two sources identified the same subjects (though the agreement between the sources was low). The similarity may seem surprising because the selfreport information is limited to what has sometimes been thought of as “minor psychiatric morbidity” involving depression and anxiety, while the physician-report data cover the full range of psychiatric disorders (45). In recent years, however, considerable evidence has accumulated that points to the fact that disorders identified through self-report are more serious than once realized. The seriousness is indicated not only in mortality risk, as here, but also in chronicity, disability, and use of services (46 – 49).

ACKNOWLEDGMENTS

The authors are grateful for the support provided by National Institute of Mental Health grants R01-MH54693, R01-MH39576, and T32-MH17119.

REFERENCES 1. Srole L, Langner TS, Michael ST, et al. Mental health in the metropolis: the Midtown Manhattan Study. New York, NY: McGraw-Hill, 1962. 2. Phillips DL. The “true prevalence” of mental illness in a New England state. Community Ment Health J 1966;2:35– 40. 3. Comstock GW, Helsing KJ. Symptoms of depression in two communities. Psychol Med 1976;6:551– 63. 4. Epidemiology of psychiatric disorders in Edmonton. Acta Psychiatr Scand Suppl 1988;338:1– 80. 5. Robins LN, Regier DA. Psychiatric disorders in America: the Epidemiologic Catchment Area Study. New York, NY: The Free Press, 1991. 6. Kessler RC, McGonagle KA, Zhao S, et al. Lifetime and 12-month prevalence of DSM-III-R psychiatric disorders in the United States: results from the National Comorbidity Survey. Arch Gen Psychiatry 1994;51:8 –19. 7. Offord DR, Boyle MH, Campbell D, et al. One year prevalence of psychiatric disorder in Ontarians 15 to 64 years of age. Can J Psychiatry 1996;41:559 – 63. 8. Leighton AH. My name is Legion: the Stirling County Study of psychiatric disorder and sociocultural environment. Vol I. New York, NY: Basic Books, Inc, 1959. 9. Murphy JM. Continuities in community-based psychiatric epAm J Epidemiol Vol. 154, No. 7, 2001

655

idemiology. Arch Gen Psychiatry 1980;37:1215–23. 10. Murphy JM, Laird NM, Monson RR, et al. A 40-year perspective on the prevalence of depression: the Stirling County Study. Arch Gen Psychiatry 2000;57:209 –15. 11. Rutter M, Tizard J, Whitmore K. Education, health and behavior. Harlow, United Kingdom: Addison Wesley Longman, Ltd, 1970. 12. Achenbach TM, Edelbrock CS. Behavioral problems and competencies reported by parents of normal and disturbed children aged four through sixteen. Monogr Soc Res Child Dev 1981;46:1– 82. 13. Offord DR, Boyle MH, Szatmari P, et al. Ontario Child Health Study II: six-month prevalence of disorder and rates of service utilization. Arch Gen Psychiatry 1987;44:832– 6. 14. Costello EJ, Angold A, Burns BJ, et al. The Great Smoky Mountains Study of Youth: goals, designs, methods and the prevalence of DSM-III-R disorders. Arch Gen Psychiatry 1996;53:1129 –36. 15. Shaffer D, Fisher P, Dulcan MK, et al. The NIMH Diagnostic Interview Schedule for Children Version 2.3 (DISC-2.3): description, acceptability, prevalence rates, and performance in the MECA Study. Methods for the Epidemiology of Child and Adolescent Mental Disorders Study. J Am Acad Child Adolesc Psychiatry 1996;35:865–77. 16. MacLeod RJ, McNamee JE, Boyle MH, et al. Identification of childhood psychiatric disorder by informant comparisons of clinic and community samples. Can J Psychiatry 1999;44: 144 –50. 17. Horton NJ, Laird NM, Zahner GEP. Use of multiple informant data as a predictor in psychiatric epidemiology. Int J Methods Psychiatr Res 1999;8:6 –18. 18. Achenbach TM, McConaughy SH, Howell CT. Child/ adolescent behavioral and emotional problems: implication of cross-informant correlations for situational specificity. Psychol Bull 1987;101:213–32. 19. Canino G, Bird HR, Rubio-Stipec M, et al. Child psychiatric epidemiology: what we have learned and what we need to learn. Int J Methods Psychiatr Res 1995;5:79 –92. 20. Bird HR, Gould MS, Staghezza B. Aggregating data from multiple informants in child psychiatry epidemiological research. J Am Acad Child Adolesc Psychiatry 1992;31:78 – 85. 21. Offord DR, Boyle MH, Racine Y, et al. Integrating assessment data from multiple informants. J Am Acad Child Adolesc Psychiatry 1996;35:1078 – 85. 22. Feldman SS, Weinberger DA. Self-restraint as a mediator of family influences on boys’ delinquent behavior: a longitudinal study. Child Dev 1994;65:195–211. 23. Bernstein DP, Cohen P, Skodol A, et al. Childhood antecedents of adolescent personality disorders. Am J Psychiatry 1996;153:907–13. 24. Gould MS, Fisher P, Parides M, et al. Psychosocial risk factors of child and adolescent completed suicide. Arch Gen Psychiatry 1996;53:1155– 62. 25. Leighton DC, Harding JS, Macklin DB, et al. The character of danger: the Stirling County Study of psychiatric disorder and sociocultural environment. Vol III. New York, NY: Basic Books, Inc, 1963. 26. Murphy JM, Monson RR, Olivier DC, et al. Affective disorders and mortality: a general population study. Arch Gen Psychiatry 1987;44:473– 80. 27. Murphy JM, Monson RR, Olivier DC, et al. Mortality risk and psychiatric disorders. Soc Psychiatry Psychiatr Epidemiol 1989;24:134 – 42. 28. Zahner GEP, Pawelkiewicz W, DeFrancesco JJ, et al. Children’s mental health service needs and utilization patterns in an urban community. J Am Acad Child Adolesc Psychiatry 1992;31:951– 60. 29. Little RJA, Rubin DB. Statistical analysis with missing data. New York, NY: John Wiley & Sons, Inc, 1987. 30. Zhao LP, Lipsitz SR, Lew D. Regression analysis with missing covariate data using estimating equations. Biometrics 1996;52:1165– 82.

656

Horton et al.

31. Hughes CC, Tremblay MA, Rapoport RN, et al. People of Cove and Woodlot: the Stirling County Study of psychiatric disorder and sociocultural environment. Vol II. New York, NY: Basic Books, Inc, 1960. 32. Murphy JM, Sobol AM, Neff RK, et al. Stability of prevalence: depression and anxiety disorders. Arch Gen Psychiatry 1984;41:990 –7. 33. Murphy JM, Neff RK, Sobol AM, et al. Computer diagnosis of depression and anxiety: the Stirling County Study. Psychol Med 1985;15:99 –112. 34. Laird NM, Olivier D. Covariance analysis of censored survival data using log-linear analysis techniques. J Am Stat Assoc 1981;76:231– 40. 35. Liang K-Y, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika 1986;73:13–22. 36. Zeger SL, Liang K-Y. Longitudinal data analysis for discrete and continuous outcomes. Biometrics 1986;42:121–30. 37. SAS Institute, Inc. SAS/Stat user’s guide: version 7-1. Cary, NC: SAS Institute, Inc, 1999. 38. Pepe MS, Whitaker RC, Seidel K. Estimating and comparing univariate associations with application to the prediction of adult obesity. Stat Med 1999;18:163–73. 39. Horton NJ, Lipsitz SR. Review of software to fit generalized estimating equation (GEE) regression models. Am Statistician 1999;53:160 –9. 40. Xie F, Paik MC. Generalized estimating equation model for

41. 42. 43. 44. 45. 46. 47. 48. 49.

binary outcomes with missing covariates. Biometrics 1997;53: 1458 – 66. Robins JM, Rotnitzky A, Zhao LP. Estimation of regression coefficients when some regressors are not always observed. J Am Stat Assoc 1994;89:846 – 66. Lipsitz SR, Ibrahim JG, Zhao LP. A weighted estimating equation for missing covariate data with properties similar to maximum likelihood. J Am Stat Assoc 1999;94:1147– 60. Efron B, Tibshirani RJ. An introduction to the bootstrap. New York, NY: Chapman & Hall, 1993. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977;33:159 –74. Finlay-Jones RA, Burvill PW. The prevalence of minor psychiatric morbidity in the community. Psychol Med 1977;7:475– 89. Murphy JM, Olivier DC, Sobol AM, et al. Diagnosis and outcome: depression and anxiety in a general population. Psychol Med 1986;16:117–26. Wells KB, Stewart A, Hays RD, et al. The functioning and well-being of depressed patients: results from the Medical Outcomes Study. JAMA 1989;262:914 –19. Broadhead WE, Blazer DG, George LK, et al. Depression, disability days, and days lost from work in a prospective epidemiologic survey. JAMA 1990;264:2524 – 8. Kessler RC, Zhao S, Blazer DG, et al. Prevalence, correlates and course of minor depression and major depression in the National Comorbidity Survey. J Affect Disord 1997;45:19 –30.

Am J Epidemiol Vol. 154, No. 7, 2001