Cancer patients' function, symptoms and supportive care needs: a ...

3 downloads 0 Views 779KB Size Report
Jan 31, 2014 - Cancer patients' function, symptoms and supportive care needs: a latent class analysis across cultures. Authors; Authors and affiliations.
Qual Life Res (2015) 24:135–146 DOI 10.1007/s11136-014-0629-4

QUANTITATIVE METHODS SPECIAL SECTION

Cancer patients’ function, symptoms and supportive care needs: a latent class analysis across cultures Jennifer Barsky Reese • Amanda Blackford • Jonathan Sussman • Toru Okuyama • Tatsuo Akechi Daryl Bainbridge • Doris Howell • Claire F. Snyder



Accepted: 16 January 2014 / Published online: 31 January 2014 Ó Springer International Publishing Switzerland 2014

Abstract Purpose Patient-reported outcomes (PROs) are an umbrella term covering a range of outcomes, including symptoms, functioning, health-related quality of life, and supportive care needs. Research regarding the appropriate PRO questionnaires to use is informative. A previously published latent class analysis (LCA) examined patterns of function, symptoms, and supportive care needs in a sample of US cancer patients. The current analysis investigated whether the findings from the original study were replicated in new samples from different countries and whether a larger sample combining all the data would affect the classes identified. Methods This secondary analysis of data from 408 Japanese and 189 Canadian cancer patients replicated the methods used in the original LCA using data from 117 US

cancer patients. In all samples, subjects completed the EORTC-QLQ-C30 and Supportive Care Needs Survey Short Form-34 (SCNS-SF34). We first dichotomized individual function, symptom, and need domain scores. We then performed LCA to investigate the patterns of domains for each of the outcomes, both in the individual country samples and then combining the data from all three samples. Results Across all analyses, class assignment was made by level of function, symptoms, or needs. In individual samples, only two-class models (‘‘high’’ vs. ‘‘low’’) were generally identifiable while in the combined sample, threeclass models (‘‘high’’ vs. ‘‘moderate’’ vs. ‘‘low’’) best fit the data for all outcomes. Conclusions In this analysis, the level of burden experienced by patients was the key factor in defining classes.

J. B. Reese (&) Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, 5510 Nathan Shock Dr., Suite 100, Baltimore, MD 21224, USA e-mail: [email protected]

D. Howell University Health Network, Princess Margaret Hospital, Toronto, ON, Canada

A. Blackford  C. F. Snyder Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins, Baltimore, MD, USA J. Sussman  D. Bainbridge Department of Oncology, McMaster University, Hamilton, ON, Canada T. Okuyama Division of Palliative Care and Psycho-oncology, Nagoya City University Hospital, Nagoya, Japan

D. Howell Lawrence Bloomberg Faculty of Nursing, University of Toronto, Toronto, ON, Canada C. F. Snyder Division of General Internal Medicine, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA C. F. Snyder Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA

T. Okuyama  T. Akechi Department of Psychiatry and Cognitive-Behavioral Medicine, Nagoya City University Graduate School of Medical Sciences, Nagoya, Japan

123

136

Keywords Patient-reported outcomes  Quality of life  Cancer  Supportive care needs  Latent class analysis

Introduction Patient-reported outcomes (PROs) are an umbrella term for various information collected directly from patients about a health condition or its treatment [1, 2]. Different types of PROs include symptoms, functioning, health-related quality of life (HRQOL), and supportive care needs. Research regarding the appropriate PRO questionnaires to use is informative [3, 4]. A prior study by members of our research team used latent class analysis (LCA) to examine the relationship between symptoms, function, and supportive care needs in individual patients [5]. The LCA results categorized 117 cancer patients in the US as either being high or low need, high or low function, or high or low symptom. Further, the results indicated that different domains contributed differentially to class assignment, supporting the need for an approach such as latent class analysis rather than simply summing domains. This previous study also evaluated whether symptoms, function, and supportive care needs are redundant by examining the concordance of patients’ class assignment for each of the three outcomes. That is, we examined the extent to which patients fell in concordant categories (highfunction/low symptom/low need or low function/high symptom/high need). While a majority of patients (66/117; 56 %) were in concordant categories, among patients with a deficit in at least one area (n = 68), concordance in classifications was much lower; 75 % had some discordance in their classifications. These findings suggest that using scores in one outcome (e.g., symptoms) to predict scores in another (e.g., function) may not be appropriate. While informative, the previous study was limited by a relatively small sample size and subjects recruited from a single academic institution in the US. Because of the small sample size, it is not clear whether the two-class models were found because they best represent the clinical phenomena under study or because the sample size did not allow for the identification of larger (e.g., three-class) models. For example, an alternative classification with more than two classes could have categorized patients based on clusters of symptoms rather than levels of symptom burden (e.g., patients with gastrointestinal symptoms, patients with fatigue, and patients without symptoms). Further, because the sample was recruited from a single institution, the generalizability of the findings was unclear. The primary objective of the current study was to examine whether the findings from the original LCA in the US would be replicated in samples of cancer patients from

123

Qual Life Res (2015) 24:135–146

Canada and Japan. Specifically, we used LCA to investigate a) whether classification of function, symptoms, and needs in the Japanese and Canadian samples followed similar patterns as the original analysis, and b) whether analysis using a larger data set combining all three samples enabled LCA solutions with [2 classes. We expected that there would be general consistency across samples in the types of classes found but that larger data sets might allow models with more than two classes to emerge. Because our primary research question was concerned with examining the replication of the original findings, we employed the same methodological approach for the current study as was used in the original study.

Methods Study population This secondary analysis compared data collected in prior studies in Japan and Canada to a previously published latent class analysis of data from the US [5]. In all three studies, subjects completed the European Organization for Research and Treatment of Cancer (EORTC), Quality of Life Questionnaire Core 30 (QLQ-C30), and the Supportive Care Needs Survey Short Form-34 (SCNS-SF34). All studies were approved by their respective institutional review boards and conducted in accordance with the 1964 Declaration of Helsinki principles. This secondary data analysis used de-identified data and was considered exempt. Each of the study samples is described briefly below. US sample The original latent class analysis included data from 117 breast, prostate, and lung cancer patients currently undergoing treatment and age 18 years or older recruited from the Johns Hopkins Sidney Kimmel Comprehensive Cancer Center between January and May 2006. Detailed information on the study sample has been reported elsewhere [5, 6]. Briefly, the mean age of the 117 patients was 61 years, 49 % were female, 77 % were white, and 95 % had an Eastern Cooperative Oncology Group (ECOG) performance status of 0 or 1, reflecting good function. Most patients had breast (43 %) or prostate (41 %) cancer with fewer lung cancer participants (16 %). Canadian sample The Canadian data were drawn from a prospective, multisite, clustered randomized controlled trial and included newly diagnosed breast or colorectal cancer patients undergoing surgery in greater Toronto. This analysis used

Qual Life Res (2015) 24:135–146

the baseline data (i.e., within 7 days of discharge from the hospital following surgery) from 189 subjects with a mean age of 60 years, 21 % of whom were male. The majority had breast cancer (63 %) and the remainder, colorectal cancer (37 %). Data were collected between December 2005 and February 2009. Japanese sample The Japanese data were originally collected as part of the validation study of the Japanese version of the SCNS-SF34 [7]. The patient sample consisted of 408 ambulatory patients diagnosed with breast cancer with a mean age of 56 and attending the outpatient oncology, immunology, and surgery clinic of Nagoya City University Hospital. These women were in various stages of the cancer trajectory, ranging from 11 to 17,915 days from diagnosis (median 701). Data were collected between February 2006 and April 2007. Measures Supportive Care Needs Survey Short Form-34 The SCNS-SF34 is a self-reported patient measure, assessing the perceived needs of cancer patients [8, 9]. Respondents indicate their level of need for help over the last month across five domains of need: psychological, health system and information, physical and daily living, patient care and support, and sexuality. The following five response options are used: 1 = not applicable (no need); 2 = satisfied; 3 = low need; 4 = moderate need; 5 = high need. Domain scores were calculated by averaging the scores of items within each domain. EORTC-QLQ-C30 The EORTC-QLQ-C30 is a 30-item measure that has been used to assess quality of life in cancer patients in over 3,000 studies worldwide [10]. This measure includes five function subscales (physical, role, emotional, social, and cognitive), eight symptoms (fatigue, pain, nausea and vomiting, dyspnea, insomnia, appetite loss, constipation, and diarrhea), and measures of financial impact and global health/quality of life. All domains are transformed to a 0–100 score, with higher scores on the function domains representing better function and higher scores on the symptom domains representing greater symptom burden. Statistical analysis We used latent class analysis to identify patterns in function, symptoms, and supportive care needs within patients. Latent class analysis models the relationship between discrete

137

observed variables (in this case cancer-related function, symptoms, and supportive care needs) and a categorical latent variable that is not directly measurable [11]. To perform the latent class analysis using the Canadian and Japanese samples, we used the same latent class analysis methods as we used in the original US study [5]. First, we dichotomized the individual domain scores for each of the outcomes to reduce the potential number of possible response patterns and improve the ability to identify classes. We used identical criteria for dichotomization as were used in the initial study. For the five function and eight symptom domain scores from the QLQ-C30, we used as the cut-off mean scores from 1,965 Norwegians sampled at random from the general population and published in the EORTC reference manual [12]. Because the way the SCNS-SF34 is scored, average scores B2.0 represent no unmet need and scores [2.0 represent some level of unmet need. We used 2.0 as the cut-off, thus differentiating between patients with and without unmet needs. Latent class models were then fit to each of the three sets of outcomes (i.e., function, symptoms, and needs). The results of the latent class analyses were then used to categorize patients into classes based on reported patterns of function, symptoms, and needs. Latent class analysis uses information about the patterns in the data to assign individuals to classes and allows each domain to have a different relationship with class assignment. Unlike simply summing domains to create a scale, which assumes each domain has equal weight, latent class analysis utilizes prevalence information for each domain to help distinguish between classes and allows certain domains to be more or less informative of class assignment. For each set of outcomes, two- and three-class models were estimated. Model identifiability was evaluated by comparing point estimates between models and examining the width of posterior intervals of model parameters. Generally, due to limitations in sample size, we fit only two-class models. However, for outcomes where both twoand three-class models were identifiable, the appropriate number of classes was determined by calculating the Bayesian information criteria (BIC) for each model and selecting the model with the lowest BIC. A Bayesian estimation procedure was chosen because it provides posterior probabilities of class membership. Specifically, from the model results, posterior probabilities were estimated for each subject, defined as the probability of class membership for each class. The posterior probabilities were then used to categorize patients into different classes based on observed patterns of function, symptoms, and needs. Specifically, each patient was placed into one class for each of the three outcomes using the highest posterior probability for that subject. After conducting latent class analyses for each country separately, we then combined the data from the three datasets to determine whether the increased

123

138

Qual Life Res (2015) 24:135–146

sample size would allow for identification of models with more classes. Latent class models assume conditional independence, i.e., that there should be no association between function, symptom, or need domains within classes. Thus, we examined all of the possible two-by-two tables of function, symptom, and need domains within classes to test for associations using Fisher’s exact test. Due to the number of comparisons (144 domain pairs), we considered p values \0.001 to be statistically significant. While this is a small p value, it is more conservative than a Bonferroni-corrected p value of 0.0003 and would produce more combinations that could violate conditional independence. Latent class analyses were performed using WinBUGS and a Bayesian estimation procedure. A technical appendix with the specification of the Bayesian analyses is provided as supplementary material. Other statistical analyses were performed using R version 2.15.1.

(a)

Results

0.5

1 0.8 0.6 0.4 0.2 0

Physical

Role

Emo onal

Low funcon U.S. Low funcon Japan High funcon Canada

Cogn ve

Social

Low funcon Canada High funcon U.S. High funcon Japan

(b) 1

Latent class analysis of function, symptoms, and supportive care needs in individual samples 0

The results of the Canadian and Japanese latent class analyses largely replicated the findings from the original US sample in that class assignment were made by level of function, symptoms, or needs (‘‘high’’ vs. ‘‘low’’ in 2-class models; ‘‘high’’ vs. ‘‘moderate’’ vs. ‘‘low’’ in 3-class models). Also, as in the prior analyses, only two-class models (‘‘high’’ vs. ‘‘low’’) were identifiable with the exception of symptoms in the larger Japanese sample, in which a three-class model was identified (‘‘high’’ vs. ‘‘moderate’’ vs. ‘‘low’’). In the US and Japanese samples, the majority of patients were in the ‘‘high-function’’ class (59 % for both); in the Canadian sample, only 40 % were categorized as ‘‘high-function.’’ Similarly, 62 % of the US sample fell in the ‘‘lowsymptom’’ class compared to 36 % in the Canadian sample. The percentage of patients in the ‘‘low-,’’ ‘‘moderate-,’’ and ‘‘high-symptom’’ classes in the Japanese sample were 48, 35, and 17 %, respectively. Finally, the US sample was most likely to be classified as ‘‘low-need’’ (70 %) compared to 56 % in the Canadian sample and 49 % in the Japanese sample. Figure 1 shows the prevalence of worse performance for the different function, symptoms, and supportive care needs classes across the three samples based on the model classes identified. As can be seen in Fig. 1a, b, c, there was generally good differentiation between or among the classes across all three samples, and differences in prevalence between or among classes ranged considerably (with wider differences revealing greater differentiation in class by that domain). For instance, in the function analyses, emotional

123

High symptom U.S. High symptom Japan Low symptom U.S. Low symptom Japan

High symptom Canada Moderate symptom Japan Low symptom Canada

(c) 1 0.8 0.6 0.4 0.2 0

Phys

Psy High need U.S. High need Japan Low need Canada

PCS

HSInf

SX

High need Canada Low need U.S. Low need Japan

Fig. 1 Prevalence of worse performance in the two/three classes across the different samples. a Prevalence of worse function in the two classes. b Prevalence of worse symptoms in the two/three classes. c Prevalence of higher needs in the two classes. Note Naus/Vom nausea/vomiting, Dysp dyspnea, App appetite, Const constipation, Diarr diarrhea, Phys physical and daily living, Psy psychological, PCS patient care and support, HSInf health system and information, SX sexuality

Qual Life Res (2015) 24:135–146

function differentiated very well between the ‘‘high’’ and ‘‘low’’ classes in all three samples. Role and physical function did not differentiate well for the Canadian sample though they did for the other samples; social function generally did not differentiate well. In the symptom analyses, fatigue differentiated well between ‘‘high-’’ and ‘‘low-symptom’’ classes across the three samples while several symptoms (e.g., diarrhea, constipation) did not differentiate well. In the needs analyses, psychologic and health system and information needs differentiated well between ‘‘high-’’ and ‘‘low-needs’’ groups, while sexuality needs differentiated well only for the US sample; sexuality needs differentiated particularly poorly in the Japanese sample because these needs were very uncommon for both classes. While in all cases classification was made based on the level of the outcome, there were differences across samples in the prevalence levels. For instance, as shown in Fig. 1a, the prevalence for those in the ‘‘low-function’’ classes was similar for all domains in all three samples, suggesting that the ‘‘low-function’’ classification was relatively similar across the samples. By contrast, with the exception of emotional function, the prevalence ranged considerably for those in the ‘‘high-function’’ classes across the samples, suggesting differences across samples in the definition of ‘‘high-function.’’ From the model results, posterior probabilities were estimated for each subject. An examination of the posterior probability (i.e., the probability of class membership obtained from model results) for each subject for function, symptoms, and supportive care needs within each sample showed similarities with the prior study. Figure 2 shows the posterior probability of being assigned to the poorperforming class for each of the three outcomes in the US [5], Canadian, and Japanese samples. Although some patients fall in the middle (e.g., Canadian ‘‘low-function’’ class), most patients had either a very high ([0.90) or very low probability (\0.10) of being assigned to a particular class. This suggests good differentiation with most patients having a very high or low probability of being assigned to the poorly performing class (i.e., ‘‘low-function,’’ ‘‘highsymptom,’’ ‘‘high-need’’). Among 288 pairs within classes across all analyses in all three samples, only one pair within classes tested was significantly associated with each other at a threshold of p \ 0.001, suggesting that the assumption of conditional independence holds.

Latent class analysis of function, symptoms, and supportive care needs in combined sample In analyses combining all three samples, a three-class model fit better than a two-class model for each outcome.

139

Figure 3 displays the prevalence of worse performance in the combined sample for each of the three classes (‘‘high,’’ ‘‘moderate,’’ and ‘‘low’’). That a three-class model was found, suggests that with a larger sample size, a greater number of classes were able to be identified. As in the individual samples, classes were generally assigned based on level rather than type of burden. Thus, the increased sample size allowed for a more granular classification of level of burden but did not lead to stark differences in the types of classes identified. However, there were some nuances; for instance, one different interpretation of the function analyses in the combined sample might be: ‘‘consistent low function,’’ ‘‘low physical/role functions but moderate others,’’ and ‘‘high physical/role functions but moderate others.’’ The plurality of patients was categorized as ‘‘highfunction’’ (47 %), ‘‘moderate-symptom’’ (40 %), and ‘‘low-need’’ (45 %). The percentage of patients in the classes with poorest performance (i.e., ‘‘low-function,’’ ‘‘high-symptom,’’ and ‘‘high-need’’) was 37, 22, and 35 %, respectively. Finally, the percentages of patients in the remaining classes (‘‘moderate-function,’’ ‘‘low-symptom,’’ and ‘‘moderate-need’’) were 16, 38, 20 %. We examined the percentage of US, Canadian, and Japanese patients comprising the classes in the combined analyses. For needs, each sample contributed roughly the same percentage of patients to each level (e.g., 25, 37, and 26 % of Canadian, Japanese and US patients were classified as ‘‘high-need,’’ respectively). However, for function and symptoms, more Canadian patients were assigned to poorly performing classes in the combined sample relative to the other samples. Specifically, more Canadian patients were classified as ‘‘low-function’’ (57 %) compared to the Japanese and US samples (29 and 32 %, respectively) and as ‘‘high-symptom’’ (30 %) compared to these other samples (16 and 23 %, respectively). We also compared the classification of patients in the individual country and combined analyses and found that there was general consistency among the classifications (e.g., patients categorized in the worse class in the individual country analyses tended to be categorized in the worse class in the combined analyses). As shown in Fig. 3a, in the function analysis, the ‘‘lowfunction’’ class had a similarly high prevalence of poor function across all domains, while the prevalence of poor function ranged considerably across domains for both the ‘‘moderate-’’ and ‘‘high-function’’ classes. By contrast, as shown in Fig. 3b, c representing the symptom and needs analyses, respectively, the ‘‘moderate-symptom’’ and ‘‘moderate-need’’ classes had a prevalence for all domains that fell between the two extremes. Fatigue and sleep symptoms were highly prevalent (C0.62) in both the ‘‘moderate-’’ and ‘‘high-symptom’’ classes in the

123

140

Qual Life Res (2015) 24:135–146

(c)

(e)

(f)

60 20

40

Frequency

60 20

40

Frequency

50 40 30 20

0

0

0

10

Frequency

60

80

70

(d)

(b)

80

(a)

0.0

0.2

0.4

0.6

0.8

0.0

1.0

0.2

0.4

0.6

0.8

0.0

1.0

Posterior probability of being in the high symptoms class

Posterior probability of being in the low functioning class

0.2

0.4

0.6

0.8

1.0

Posterior probability of being in the high needs class

(h)

100

Frequency

0

0

50

100 50

0.4

0.6

0.8

0.0

1.0

0.4

0.8

1.0

200 150 0

0

50

50

100

Frequency

Frequency

250

250

300

300

(k)

250 200 150

Frequency

100 50 0 0.0

0.2

0.4

0.6

0.8

Posterior prob of high symptoms class

1.0

0.0

0.2

0.4

0.6

0.8

1.0

Posterior prob of moderate symptoms class

symptoms analysis; similarly, psychologic needs were highly prevalent (C0.81) in both the ‘‘moderate-’’ and ‘‘high-need’’ classes in the needs analysis. Sexual needs

123

0.6

(j) 300

(i)

0.2

Posterior probability of being in the high needs class

200

0.2

Posterior probability of being in the low functioning class

150

0.0

100

Frequency

150

150

200

(g)

0.0

0.2

0.4

0.6

0.8

1.0

Posterior prob of low symptoms class

were the least common of all needs overall, but they were nearly twice as common in the ‘‘high-need’’ class as in the ‘‘moderate-need’’ class.

Qual Life Res (2015) 24:135–146 b Fig. 2 Histograms showing the distributions of model-based poster-

ior probabilities of being assigned to the classes first for the US sample (N = 117; From Snyder, Garrett-Mayer, Blackford et al. 2009; reprinted with permission) [5], followed by the Canadian sample (N = 189) and Japanese sample (N = 407). Each value represents the probability that a particular individual is in that particular class, given his or her pattern of responses to the QLQ-C30 and SCNS items. Values close to 0 and 1 represent good classification, while values near 0.50 represent class assignment based on chance. Class size is presented as the proportion of the sample (out of 100). a Posterior probabilities of being in the low function class in the US sample (size = 0.60). b Posterior probabilities of being in the high symptom class in the US sample (size = 0.64). c Posterior probabilities of being in the high need class in the US sample (size = 0.56). d Posterior probabilities of being in the low function class in the Canadian sample (size = 0.60). e Posterior probabilities of being in the high symptom class in the Canadian sample (size = 0.64). f Posterior probabilities of being in the high need class in the Canadian sample (size = 0.56). g Posterior probabilities of being in the low function class in the Japanese sample (size = 0.41). h Posterior probabilities of being in the high need class in the Japanese sample (size = 0.51). i Posterior probabilities of being in the high symptom class in the Japanese sample (size = 0.17). j Posterior probabilities of being in the moderate symptom class in the Japanese sample (size = 0.35). k Posterior probabilities of being in the low symptom class in the Japanese sample (size = 0.48)

141

(a) 1 0.8 0.6 0.4 0.2 0

Physical

Role

Emo onal Cogn ve

Low funcon High funcon

Social

Moderate funcon

(b) 1 0.8 0.6 0.4 0.2

Figure 4 shows the posterior probability of being assigned to classes for function (a), symptoms (b), and supportive care needs (c) in the combined sample. For function, most patients have very high ([0.90) or very low (\0.10) probabilities of being assigned to the ‘‘high-’’ or ‘‘low-function’’ classes. A much smaller number of patients fell in the ‘‘moderate’’ class (16 %), and most patients had a very low (\0.10) probability of being in the ‘‘moderate-function’’ class but this suggests that the patients assigned to this class are done so appropriately. The probability of being in the ‘‘low-symptom’’ or ‘‘lowneed’’ classes was reasonably well differentiated (either \0.10 or [0.90). The difference between the ‘‘moderate-’’ and ‘‘high-symptom’’ classes was less distinguishable. For those assigned to the ‘‘moderate-’’ or ‘‘high-need’’ classes, the probability of being in the class was not as strong in that fewer patients had a probability [0.80. There was a clear division between the strong probabilities of not being in the ‘‘high-’’ or ‘‘moderate-need’’ class (\0.10) or suggestion of being in either ([0.10). Altogether, only 11 out of 144 pairs within classes tested were significantly associated with each other at a threshold of p \ 0.001, so there was no evidence against conditional independence.

Discussion The current study examined whether cancer patients’ patterns of function, symptoms, and supportive care needs

0

High symptom Low symptom

Moderate symptom

(c) 1 0.8 0.6 0.4 0.2 0

Phys

Psy High need Low need

PCS

HSInf SX Moderate need

Fig. 3 Prevalence of worse performance in the three classes in the combined sample (N = 687). a Prevalence of worse function in the three classes. b Prevalence of worse symptoms in the three classes. c Prevalence of higher needs in the three classes. Note Naus/Vom nausea/vomiting, Dysp dyspnea, App appetite, Const constipation, Diarr diarrhea, Phys physical and daily living, Psy psychological, PCS patient care and support, HSInf health system and information, SX sexuality

123

142

Qual Life Res (2015) 24:135–146

(a) Class size=.16

0.2

0.4

0.6

0.8

0.0

1.0

Posterior prob of high functioning class

300

Frequency

100 0

100 0 0.0

200

300

Frequency

400

400

500

500

Class size=.37

200

300 200 0

100

Frequency

400

500

Class size=.47

0.2

0.4

0.6

0.8

1.0

0.0

0.2

0.4

0.6

0.8

1.0

Posterior prob of low functioning class

Posterior prob of moderate functioning class

(b)

0.2

0.4

0.6

0.8

500

Frequency 0.0

1.0

200 100 0

100 0

0 0.0

300

400

500 300 200

Frequency

400 300 200 100

Frequency

Class size=.38

Class size=.40

400

500

Class size=.22

0.2

0.4

0.6

0.8

1.0

0.0

Posterior prob of moderate symptoms class

Posterior prob of high symptoms class

0.2

0.4

0.6

0.8

1.0

Posterior prob of low symptoms class

(c)

0.2

0.4

0.6

0.8

Posterior prob of high needs class

1.0

Class size=.45

500

500

300 100

200

Frequency

400

400 300 100

0

0 0.0

123

Class size=.20

200

Frequency

300 200 0

100

Frequency

400

500

Class size=.35

0.0

0.2

0.4

0.6

0.8

Posterior prob of moderate needs class

0.0

0.2

0.4

0.6

0.8

Posterior prob of low needs class

1.0

Qual Life Res (2015) 24:135–146 b Fig. 4 Histograms showing the distributions of model-based poster-

ior probabilities of being assigned to each class in the combined sample (N = 687). Each value represents the probability that a particular individual is in that particular class, given his or her pattern of responses to the QLQ-C30 and SCNS items. Values close to 0 and 1 represent good classification, while values near 0.50 represent class assignment based on chance. Class size is presented as the proportion of the combined sample (out of 100). a Posterior probabilities of being in the high-, moderate-, and low-function classes in the combined sample. b Posterior probabilities of being in the high-, moderate-, and low-symptom classes in the combined sample. c Posterior probabilities of being in the high-, moderate-, and lowneed classes in the combined sample

were consistent across cultures by comparing whether results of a latent class analysis conducted previously with data from cancer patients in the US [5] would be replicated in samples of cancer patients from Canada and Japan. Indeed, the results of the Japanese and Canadian data analyses were generally consistent with the US findings in that patients were classified based on their level of burden in each of the three outcomes. There were interesting differences in the prevalence of poor (and high) performance, suggesting some variation in the specific classes across the three samples. In other words, what it means to be classified as ‘‘high-,’’ ‘‘low-,’’ or ‘‘moderately-’’ performing may differ in different patient groups. Further, the larger sample size in the analyses combining all three samples allowed three-class models to emerge, but these classifications were still generally based on the level of burden. That we found similarities in the classification of patients across cultures and clinically diverse samples, as well as in the separate and pooled analyses, supports the findings, even though the specific performance of groups across domains might differ in different samples. Although findings from the study generally supported a method of classification based on level of burden, there were some nuances in our findings that may suggest symptom clusters. Symptom clusters are defined as the presence of three or more concurrent symptoms that are related and may or may not have a similar etiology [13]. For instance, classification of patients from the combined sample function analysis might be interpreted as ‘‘consistent low-function,’’ ‘‘low physical/role function but moderate others,’’ and ‘‘high physical/role function/moderate others’’ as opposed to ‘‘low’’, ‘‘moderate,’’ and ‘‘high function’’. Given that the identification of symptom clusters has been shown to differ across research samples due to differences in characteristics of the study population (e.g., cancer site, cancer stage, phase in treatment) [14], it is possible that the heterogeneity in the three cross-cultural samples in the current study may have limited our ability to find symptom clusters. In future studies, larger samples using relatively homogeneous groups allowing for a larger number of classes to be tested might be better able to

143

determine whether patients’ function, symptoms, and needs are best classified according to level of burden or symptom clusters. The findings of this study should be interpreted in the context of its strengths and limitations. Important strengths include the inclusion of patients with varying diagnoses, stages of treatment, and countries of origin. While the study sample was quite large when combined, it may not have been large enough to find the presence of symptom clusters in a heterogeneous group. Another major limitation is that differences in demographic and clinical characteristics across the countries in the current study limited our ability to compare whether patients across cultures were similar in their classifications. That is, had this analysis included patient samples at similar points in the cancer trajectory, we could have compared whether patients across cultures experienced similar levels of function, symptoms, and needs. Because of the heterogeneity of our samples, it is unclear whether differences in class sizes across countries were due to clinical reasons alone or whether culture also played a role. For instance, class sizes of the more poorly performing classes were larger in the Canadian sample, and a greater percentage of this sample made up the poorly performing patient classes in the combined analysis, but this sample was recruited immediately after surgery and therefore could be expected to report higher symptom severity or higher supportive care needs. Additionally, taboos against reporting sexuality needs may have contributed to the low self-reported rate of sexuality needs found for the Japanese sample although this would need to be supported by future research. Finally, given that treatments can be a key driver of the patient perspective, future studies should gather complete treatment data—which we did not have—to allow for a detailed analysis of how treatment plays a role in patients’ function, symptoms, and supportive care needs and their classification in these outcomes. An additional limitation is that this analysis did not allow us to examine the concordance of patients’ symptoms, function, and needs, as we did in the prior study [5]. As the sample size and therefore the number of classes in the models grew in the current study, it became increasingly difficult to define what classifications would be considered ‘‘concordant.’’ For instance, is moderate function concordant only with moderate symptoms and needs? Continued research examining the associations among cancer patients’ function, symptoms, and supportive care needs using the finer gradations that emerged from this study could clarify our understanding of these relationships. In summary, this analysis supported the findings of our original study that the levels of deficits patients experience in their symptoms, functioning, and supportive care needs

123

144

Qual Life Res (2015) 24:135–146

are the key factors in classification. Findings support the use of latent class analysis as an informative method for understanding patterns in cancer patients’ function, symptoms, and needs. Changes to the methodological or statistical approach employed, including using different cut-off scores to dichotomize performance or employing other statistical methods (e.g., latent profile analysis), could potentially lead to different classifications and should be investigated in future research. Further, future research using longitudinal designs and interventions will allow investigation of how patients’ classifications change over time or whether they are amenable to change through interventions (e.g., symptom management, cognitive behavior therapy) [15, 16]. Acknowledgments The authors wish to thank Elizabeth GarrettMayer, PhD, for her valuable review of a preliminary draft and for assistance with revising the manuscript. Dr. Snyder is supported by the American Cancer Society (MRSG-08-011-01-CPPB) and is a member of the Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins (P30CA006973). The original Japanese data collection was supported in part by Grants-in-Aid for Cancer Research and the Third Term Comprehensive 10-Year Strategy for Cancer Control from the Ministry of Health, Labour and Welfare, Japan. The original Canadian data collection was supported by Canadian Institutes of Health Research and Ontario Ministry of Health & Long-term Care.

Technical appendix The latent class analysis for our study was completed using a Bayesian estimation procedure in WinBUGS 14. The first step in running the analysis is to specify a model. Separate code specifying a unique model is required for each combination of sample (Japanese, US, Canadian, combined), domain (needs, symptoms, functioning), and class number (two or three). The two-class model for the Japanese data for symptoms is shown below as an example.

Editing the inputs for k (number of items), m (number of classes), and N (number of patients) would customize the code for each unique combination required. Once the model is specified, the user loads the data into WinBUGS. The data are denoted by y and are a binary matrix with k rows and N columns, denoting whether each patient had reported each symptom (0 represents no symptom and 1 indicates presence of the symptom). Each patient’s probability of reporting a given symptom is assumed to follow a binomial distribution with probability p. The logit (p) is estimated from the matrix gamma, which is sampled from a N (0,0.45) distribution separately for each item k and class m. Starting values are required for gamma, and we gave values of 0.25 for all model combinations. The probability of being in class m is estimated by pi which is specified by parameter alpha. The estimate of alpha is made by drawing from a N (0,0.45) distribution. A starting value for alpha is required, and we used starting values of 0.25 for all model combinations. Once the model, data and starting values are loaded, WinBUGS asks for the number of burn-in iterations for model estimation. We ran 10,000 iterations per model combination. The output from WinBUGS for the example shown above looks like:

Node

Mean

SD

MC error

2.50 %

Median

97.50 %

Start

Sample

p[1,1]

0.745

0.3261

0.004211

0.1969

0.9499

0.9876

1

10,000

p[1,2]

0.2097

0.1066

0.001235

0.036

0.2539

0.3481

1

10,000

p[1,3]

0.5213

0.2749

0.003348

0.0755

0.6746

0.7821

1

10,000

p[1,4]

0.4853

0.2462

0.003022

0.0826

0.6199

0.7248

1

10,000

p[407,6]

0.4954

0.0710

0.001009

0.3992

0.4996

0.5978

1

10,000

p[407,7]

0.5357

0.0502

5.187E-4

0.4423

0.5382

0.6191

1

10,000

p[407,8]

0.2946

0.0447

4.982E-4

0.216

0.295

0.3745

1

10,000

pi[1]

0.404

0.0356

8.882E-4

0.3332

0.4041

0.4732

1

10,000

pi[2]

0.596

0.0356

8.882E-4

0.5269

0.5959

0.6668

1

10,000

tau[1,1]

0.9626

0.0198

3.431E-4

0.9157

0.9657

0.9918

1

10,000

tau[1,2]

0.2464

0.0392

9.36E-4

0.1713

0.2462

0.3254

1

10,000

tau[2,1]

0.2783

0.0383

6.751E-4

0.2073

0.2766

0.3575

1

10,000

tau[2,2]

0.0548

0.0153

1.757E-4

0.0287

0.0535

0.0879

1

10,000

123

Qual Life Res (2015) 24:135–146

145

continued Node

Mean

SD

MC error

2.50 %

Median

97.50 %

Start

Sample

tau[3,1]

0.7055

0.0458

0.0010

0.6156

0.7056

0.796

1

10,000

tau[3,2]

0.1068

0.0240

4.31E-4

0.0631

0.1055

0.157

1

10,000

tau[4,1]

0.6467

0.0449

8.16E-4

0.5589

0.6464

0.7357

1

10,000

tau[4,2]

0.11

0.0247

4.155E-4

0.0661

0.1087

0.1616

1

10,000

tau[5,1]

0.7927

0.0387

7.449E-4

0.7142

0.7933

0.8639

1

10,000

tau[5,2]

0.2424

0.0324

5.541E-4

0.1805

0.2416

0.3078

1

10,000

tau[6,1]

0.5018

0.0469

8.694E-4

0.4143

0.5004

0.598

1

10,000

tau[6,2]

0.0548

0.0168

2.547E-4

0.0249

0.0530

0.0903

1

10,000

tau[7,1]

0.5392

0.0415

4.611E-4

0.4591

0.539

0.6197

1

10,000

tau[7,2] tau[8,1]

0.3037 0.2977

0.0318 0.0377

4.452E-4 4.592E-4

0.2419 0.2282

0.3033 0.2959

0.3668 0.3748

1 1

10,000 10,000

tau[8,2]

0.1011

0.0208

2.963E-4

0.0636

0.1001

0.1444

1

10,000

The matrix p, with dimension 407 by 8, represents the probability of a specific patient reporting a specific symptom. The summary statistics [mean (SD), median (2.5 and 97.5 percentiles)] of the values from all 10,000 iterations are shown. Similarly, there are estimates of pi[1] and pi[2], which sum to 100 % and are the estimates we reported for the probability of class assignment. The three-class model would also include pi[3]. The estimates for tau represent the probability of reporting a specific symptom given class assignment and are the values reported in the paper. These estimates are particularly helpful in determining whether the model fit is adequate. For example, models where the 2.5–97.5 % values for tau covered the whole range from 0 to 1, or close to it, suggested poor model fit. The three-class model for symptoms for the Japanese data is shown below. The three-class model requires an extra specification of pi[3] and alpha[2] but otherwise the model specification and assumptions are the same as for the two-class model.

References 1. Acquadro, C., Berzon, R., Dubois, D., Leidy, N. K., Marquis, P., Revicki, D., et al. (2003). Incorporating the patient’s perspective into drug development and communication: an ad hoc task force report of the patient-reported outcomes (PRO) Harmonization Group meeting at the Food and Drug Administration, February 16, 2001. Value in Health, 6(5), 522–531. 2. U.S. Food and Drug Administration (2009). Guidance for industry: Patient reported outcome measures: Use in medical product development to support labeling claims. 3. Snyder, C. F., Dy, S. M., Hendricks, D. E., Brahmer, J. R., Carducci, M. A., Wolff, A. C., et al. (2007). Asking the right questions: investigating needs assessments and health-related quality-of-life questionnaires for use in oncology clinical practice. Supportive Care in Cancer, 15(9), 1075–1085. 4. Snyder, C. F., Jensen, R. E., Geller, G., Carducci, M. A., & Wu, A. W. (2010). Relevant content for a patient-reported outcomes questionnaire for use in oncology clinical practice: Putting doctors and patients on the same page. Quality of Life Research, 19(7), 1045–1055. 5. Snyder, C. F., Garrett-Mayer, E., Blackford, A. L., Brahmer, J. R., Carducci, M. A., Pili, R., et al. (2009). Concordance of cancer patients’ function, symptoms, and supportive care needs. Quality of Life Research, 18(8), 991–998. 6. Snyder, C. F., Garrett-Mayer, E., Brahmer, J. R., Carducci, M. A., Pili, R., Stearns, V., et al. (2008). Symptoms, supportive care needs, and function in cancer patients: How are they related? Quality of Life Research, 17(5), 665–677. 7. Okuyama, T., Akechi, T., Yamashita, H., Toyama, T., Endo, C., Sagawa, R., et al. (2009). Reliability and validity of the Japanese version of the short-form supportive care needs survey questionnaire (SCNS-SF34-J). Psychooncology, 18(9), 1003–1010. 8. Bonevski, B., Sanson-Fisher, R., Girgis, A., Burton, L., Cook, P., & Boyes, A. (2000). Evaluation of an instrument to assess the needs of patients with cancer. Supportive Care Review Group. Cancer, 88(1), 217–225. 9. Sanson-Fisher, R., Girgis, A., Boyes, A., Bonevski, B., Burton, L., & Cook, P. (2000). The unmet supportive care needs of patients with cancer. Supportive Care Review Group. Cancer, 88(1), 226–237. 10. Aaronson, N. K., Ahmedzai, S., Bergman, B., Bullinger, M., Cull, A., Duez, N. J., et al. (1993). The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life

123

146 instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute, 85(5), 365–376. 11. Lanza, S. T., Bray, B. C., Collins, L. M. (2013). An introduction to latent class and latent transition analysis. In I. B. Weiner (Ed.), Handbook of psychology: Research methods in psychology (Second Edition, Vol. 2, pp. 691–716). Hoboken: John Wiley & Sons, Inc. 12. Fayers, P. M., Weeden, S., Curran, D., on behalf of the EORTC Quality of Life Study Group. (1998). EORTC reference values. Brussels: EORTC. 13. Dodd, M. J., Miaskowski, C., & Paul, S. M. (2001). Symptom clusters and their effect on the functional status of patients with cancer. Oncology Nursing Forum, 28(3), 465–470.

123

Qual Life Res (2015) 24:135–146 14. Fan, G., Filipczak, L., & Chow, E. (2007). Symptom clusters in cancer patients: a review of the literature. Current Oncology, 14(5), 173–179. 15. Miaskowski, C., Dodd, M., Lee, K. (2004). Symptom clusters: The new frontier in symptom management research. Journal of the National Cancer Institute. Monographs (32), 17–21. 16. Gift, A. G., Stommel, M., Jablonski, A., & Given, W. (2003). A cluster of symptoms over time in patients with lung cancer. Nursing Research, 52(6), 393–400.