Measure of Injury Severity on the Basis of ICD-9-CM .... open with cerebral laceration or contusion, with moderate (1â24 .... Crush injury above the knee.
The Journal of TRAUMA威 Injury, Infection, and Critical Care
Harborview Assessment for Risk of Mortality: An Improved Measure of Injury Severity on the Basis of ICD-9-CM T. Al West, MD, MPH, Frederick P. Rivara, MD, MPH, Peter Cummings, MD, MPH, Gregory J. Jurkovich, MD, and Ronald V. Maier, MD Background: There have been several attempts to develop a scoring system that can accurately reflect the severity of a trauma patient’s injuries, particularly with respect to the effect of the injury on survival. Current methodologies require unreliable physiologic data for the assignment of a survival probability and fail to account for the potential synergism of different injury combinations. The purpose of this study was to develop a scoring system to better estimate probability of mortality on the basis of information that is readily available from the hospital discharge sheet and does not rely on physiologic data. Methods: Records from the trauma registry from an urban Level I trauma center were analyzed using logistic regression. Included in the regression were Internation Classification of Diseases-9th Rev (ICD-9-
CM) codes for anatomic injury, mechanism, intent, and preexisting medical conditions, as well as age. Two-way interaction terms for several combinations of injuries were also included in the regression model. The resulting Harborview Assessment for Risk of Mortality (HARM) score was then applied to an independent test data set and compared with Trauma and Injury Severity Score (TRISS) probability of survival and ICD-9-CM Injury Severity Score (ICISS) for ability to predict mortality using the area under the receiver operator characteristic curve. Results: The HARM score was based on analysis of 16,042 records (design set). When applied to an independent validation set of 15,957 records, the area under the receiver operator characteristic curve (AUC) for HARM was 0.9592. This represented significantly better discrimination
than both TRISS probability of survival (AUC ⴝ 0.9473, p ⴝ 0.005) and ICISS (AUC ⴝ 0.9402, p ⴝ 0.001). HARM also had a better calibration (Hosmer-Lemeshow statistic [HL] ⴝ 19.74) than TRISS (HL ⴝ 55.71) and ICISS (HL ⴝ 709.19). Physiologic data were incomplete for 6,124 records (38%) of the validation set; TRISS could not be calculated at all for these records. Conclusion: The HARM score is an effective tool for predicting probability of in-hospital mortality for trauma patients. It outperforms both the TRISS and ICD9-CM Injury Severity Score (ICISS) methodologies with respect to both discrimination and calibration, using information that is readily available from hospital discharge coding, and without requiring emergency department physiologic data. J Trauma. 2000;49:530 –541.
T
he quantitative characterization of injury severity has gained increasing attention over the last 20 years. The uniform application of accurate severity scales to the trauma patient population is essential for several reasons. First, comparison of therapeutic modalities and evaluations of patient outcomes can be meaningful only if objective measures of their injury severity can be assigned. Effective use of prehospital and interhospital triage depends on reliable estimates of the severity of a patient’s injuries. Quality improvement efforts and prevention programs rely on such objective data to make policy decisions and set priorities. Similarly, trauma research requires a tool by which to assure that study groups are compared with counterparts with equivalent degrees of injury.
Submitted for publication October 4, 1999. Accepted for publication May 11, 2000. Copyright © 2000 by Lippincott Williams & Wilkins, Inc. From the Department of Surgery, University of Texas Southwestern Medical Center, Dallas, Texas (T.A.W.) and Department of Surgery, University of Washington (R.V.M., G.J.J.) and Harborview Injury Prevention and Research Center (F.P.R., P.C.), Seattle, Washington. Address for reprints: T. Al West, MD, MPH, Department of Surgery, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75235-9158.
530
The modern era of injury severity scoring began in the early 1950s with the pioneering work of DeHaven.1 Since then, there have been numerous attempts to develop a scoring system that can accurately reflect the severity of a patient’s injuries, particularly with respect to the effect of those injuries on survival.2–9 However, these scoring systems have attributes that limit their ability to reliably predict mortality. First, there is often a loss of information. Physiologic and anatomic data from the patient are combined into intermediate scores, which are then combined (often in several complex steps) to achieve a final probability of survival score. Second, injury combinations are modeled as if the effects of the injuries are independent, although it is well recognized clinically that some combinations may be more lethal than a simple additive or multiplicative model would predict. Third, there are no current injury scoring systems that take preexisting disease, a widely recognized contributor to outcome,10,11 into account when assigning probability of survival. Most scoring systems rely on injury descriptions and physiologic data that are not easily and economically obtainable from standard discharge data. Last, prehospital or emergency department physiologic data is often missing, making calculation of probability of survival impossible. September 2000
HARM: A New Probability of Survival Score The Committee on Injury Prevention and Control at the Institute of Medicine has recommended validation of existing models, as well as development of new models, using common databases and standard measures of scale performance.12 We propose an alternative approach to injury severity scoring in an effort to rectify several drawbacks of the existing injury coding schemes. This injury severity classification method attempts to explicitly address the possibility that certain injury combinations might contribute to mortality beyond their independent effects. In addition, it takes comorbid disease into account when predicting mortality. Finally, this method uses data that are readily available for all patients, without relying on missing or inaccurate physiologic data.
METHODS Data were obtained from the Harborview Medical Center Trauma Registry. This computerized database contains information concerning all patients evaluated for traumatic injury at Harborview Medical Center who were either admitted to the hospital or died in the emergency department. The registry is used for quality improvement and clinical research and provides data to the Washington State Trauma Registry. Harborview Medical Center is a Level I trauma center and is a tertiary referral center for trauma, including burns and pediatric injuries, for the states of Washington, Alaska, Montana, and Idaho. Information was obtained for all trauma admissions and emergency room deaths between July 1, 1985, and December 31, 1997. No treatment-related variables were included in the analysis; only those variables that were determined upon or before the individual’s arrival at the hospital were used in initial model development. The data set was divided into two groups for the purposes of developing and testing the model. A 50% random sample from the entire set was selected as the design set; the remaining records were analyzed separately as the validation set. The design set was used to determine the best prediction model, and the validation set was used to test the accuracy of the model on an independent set of data. Statistical analysis was performed using the program Stata (College Station, TX). We first reclassified the 2,034 different Internation Classification of Diseases-9th Rev (ICD-9-CM)13 diagnosis codes representing injuries (codes 800 –959.9) into 109 injury categories. Each category represented a group of anatomically similar injuries. Every effort was made to balance the need for fine enough separation of different injury types (more categories) with the need for simplicity in modeling (fewer categories). In this classification scheme, many distinct ICD9-CM codes were collapsed into a broader category. However, there were many individual ICD-9-CM codes that contributed information to several different injury categories (Fig. 1). ICD-9-CM codes that corresponded to Abbreviated Injury Scale (AIS)1 severity scores less than 3 were excluded, Volume 49 • Number 3
Fig. 1. This single ICD-9-CM code for fracture of skull base, open with cerebral laceration or contusion, with moderate (1–24 hours) loss of consciousness and return to preexisting conscious level, would contribute to four injury categories: skull fracture, open intracranial wound, cerebral laceration or contusion, and moderate-length loss of consciousness.
parallel to the approach taken by the authors of the ASCOT model.8 These codes represent minor injuries, such as superficial wounds and minor orthopedic injuries, that have been found not to contribute significantly to mortality predictions.8 Burns and burn-related injuries were also excluded from the reclassification scheme, as the predictive model was not intended to estimate severity of burns nor predict burn mortality. A list of the injury categories and their corresponding ICD-9-CM codes can be found in Table 1. We also sought to include comorbid conditions in the model as well as injuries. ICD-9-CM codes were combined into 11 categories of comorbid disease on the basis of the earlier work of Morris and MacKenzie10 (Table 1). The mechanism of injury and its associated intent were classified on the basis of the primary External Cause of Injury code, defined within ICD-9-CM. These codes were classified according to recommendations from the Centers for Disease Control and Prevention.14 Sixteen specific mechanisms and five classes of intent were represented, as defined in Table 1. A panel of experienced trauma surgeons produced a list of 25 two-way interactions among the injury categories defined above that were judged to be clinically significant. To maintain a balance between specificity and simplicity, interactions were created from categories that were broader than 531
The Journal of TRAUMA威 Injury, Infection, and Critical Care
Table 1 Anatomic Diagnoses, Comorbid Conditions, Mechanism Categories, and Intent Categories Considered for the Model Anatomic Diagnoses Considered for the Model Injury
Skull fracture Open intracranial wound Facial fracture Concussion Cerebral laceration or contusion Cerebellar or brainstem hemorrhage Subarachnoid hemorrhage Epidural hemorrhage Subdural hemorrhage Unspecified intracranial hemorrhage Unspecified intracranial injury Loss of consciousness for ⬍24 hours Loss of consciousness for ⬎24 hours (reversible) Loss of consciousness for ⬎24 hours (irreversible) Unspecified period of loss of consciousness Complete spinal cord injury C4 or above Incomplete spinal cord injury C4 or above Unspecified spinal cord injury C4 or above Complete spinal cord injury C5–T12 Incomplete spinal cord injury C5–T12 Unspecified spinal cord injury C5–T12 Lumbar/sacral/cauda equina cord injury Unspecified spinal cord injury Fracture of larynx or trachea Uncomplicated open wound of larynx Complicated open wound of larynx Uncomplicated open wound of pharynx Complicated open wound of pharynx Common or internal carotid artery injury External/unspecified carotid artery injury Internal jugular vein injury Other/unspecified neck vessel injury Crush injury to neck Flail chest Fracture of 1–3 ribs Fracture of ⬎3 ribs Pneumothorax Hemothorax Hemopneumothorax Sternal fracture Cardiac contusion Laceration of heart Full-thickness cardiac laceration Unspecified cardiac injury Pulmonary contusion Pulmonary laceration Unspecified lung injury Diaphragm injury Bronchus injury Esophagus injury Other/unspecified intrathoracic injury Thoracic aorta or great vessels SVC or innominate vein Pulmonary vessels Unspecified thoracic vessels Open thoracic wound Stomach injury Duodenal injury Small intestine injury
ICD-9-CM Codes Included
800–801, 803–804 800–801(.4–.9), 803–804(.4–.9), 851–854(.1, .3, .5, .7, .9) 802 801–801(.9), 803–804(.9), 850, 851–854(.9) 800–801(.1, .6), 803–804(.1, .6), 851–854(.0–.3, .8) 851–854(.4–.7, .9) 852(.0, .1) 852(.4, .5) 852(.2, .3) 801–802(.2, .3, .7, .8), 803–804(.2, .3, .7, .8) 801–802(.4, .9), 803–804(.4, .9), 854 801–802(.x2, .x3), 803–804(.x2, .x3), 850(.x1, .x2), 851–854(.x2, .x3) 801–802(.x4), 803–804(x4), 850(.x3), 851–854(.x4) 801–802(.x5), 803–804(x5), 850(.x4), 851–854(.x5) 801–802(.x6), 803–804(x6), 850(.x5), 851–854(.x6) 806(.01, .11), 952.01 806(.02–.04, .12–.14), 952(.02–.04) 806(.00, .10), 952.00 806(.06, .16, .21, .26, .31, .36), 952(.06, .11, .16) 806(.07–.09, .17–.19, .22–.24, .27–.29, .32–.34, .37–.39), 952(.07–.09, .12–.14, .17–.19) 806(.05, .15, .20, .25, .30, .35), 952(.05, .10, .15) 806(.4–.7), 952(.2–.4) 806(.8–.9), 952(.8–.9) 807(.5–.6) 874(.00–.02) 874(.10–.12) 874.4 874.5 900(.01, .03) 900(.00, .02) 900.1 900(.8–.9) 925.2 807.4 807(.00–.03, .10–.13) 807(.04–.09, .14–.19) 860(.0–.1) 860(.2–.3) 860(.4–.5) 807(.2–.3) 861(.01, .11) 861(.02, .12) 861(.03, .13) 861(.00, .10) 861(.21, .31) 861(.22, .32) 861(.20, .30) 862(.0–.1) 862(.21, .31) 862(.22, .32) 862(.29, .39, .8–.9) 901(.0–.1) 901(.2–.3) 901(.40–.42) 901(.81–.89) 860–862(.1x, .3x, .5x, .7x, .9x) 863(.0–.1) 863(.21, .31) 863(.20, .29, .30, .39)
SVC, superior vena cava; GI, gastrointestinal; IVC, inferior vena cava; E-codes, external cause of injury codes; NEC, not elsewhere classifiable.
532
September 2000
HARM: A New Probability of Survival Score
Table 1 continued Anatomic Diagnoses Considered for the Model Injury
Colon injury Rectal injury Unspecified GI tract injury Injury to head of pancreas Injury to body of pancreas Injury to tail of pancreas Multiple/unspecified pancreatic injury Hematoma/contusion/minor hepatic laceration Moderate laceration of liver Major laceration of liver Other/unspecified liver injury Hematoma or capsular tear of spleen Moderate laceration of spleen Major laceration of spleen Other/unspecified splenic injury Minor kidney injury Major kidney injury Other/unspecified renal injury Bladder injury Ureter injury Uterus injury Other/unspecified abdominal organ injury Open abdominal wound Popliteal artery injury Traumatic amputation above the knee Bilateral lower extremity amputation Crush injury above the knee Femoral artery injury Open pelvic fracture Closed pelvic fracture Hip fracture Abdominal aortic injury IVC injury Hepatic vein injury Celiac/mesenteric artery injury Portal vein injury Mesenteric venous injury Splenic vein injury Iliac vessel injury Renal vessel injury Other/unspecified abdominal vessel injury
ICD-9-CM Codes Included
863(.40–.44, .46, .49, .50–.54, .56, .59) 863(.45, .55) 863(.80, .89, .90, .99) 863(.81, .91) 863(.82, .92) 863(.83, .93) 863(.84, .94) 864(.01–.02, .11–.12) 864(.03, .13) 864(.04, .14) 864(.00, .09–.10, .19) 865(.01–.02, .11–.12) 865(.03, .13) 865(.04, .14) 865(.00, .09–.10, .19) 866(.01–.02, .11–.12) 866(.03, .13) 866(.00, .10) 867(.1–.2) 867(.2–.3) 867(.4–.5) 867(.6–.9), 868, 869 863(.1x, .3x, .5x, .9x), 864–866(.1x), 867(.1x, .3x, .5x, .7x, .9x), 868–869(.1x) 904.41 897(.2–.3) 897(.6–.7) 928(.00–.01, .8) 904(.0–.1) 808(.1, .3, .5x, .9), 839.52 808(.0, .2, .4x, .8), 839.42 820 902.0 902(.11, .19) 902.11 902(.2x) 902.33 902(.31–.32) 902.34 902(.50–.54, .59) 902(.4x) 902(.55–.56, .8x, .9)
Comorbid Conditions Considered for the Model Comorbid Condition
Chronic obstructive pulmonary disease Congenital coagulopathy Diabetes mellitus Cirrhosis Ischemic heart disease Hypertension Psychoses Epilepsy Obesity Alcohol or drug dependence Neurologic degenerative disease
ICD-9-CM Diagnosis Codes
490–496 286–287(excluding 286.6, 287.4) 250 571 410–414 401–405 290–298 345 278 303–304 330–337
Mechanism Categories Mechanism
Cut/pierce Fall Firearm
Volume 49 • Number 3
ICD-9-CM E-Codes
920, 956, 966, 986, 974 880–886, 888, 957, 968.1, 987 922, 955(.0–.4), 985(.0–.4), 970
533
The Journal of TRAUMA威 Injury, Infection, and Critical Care
Table 1 continued Mechanism Categories Mechanism
ICD-9-CM E-Codes
Machinery Motor vehicle collision—occupant Motorcycle collision Motor vehicle/bicycle collision Motor vehicle/pedestrian collision Motor vehicle collision—unspecified Bicycle collision Pedestrian Other transport
919 810–819(.0–.1) 810–819(.2–.3), 820–825(.2–.3) 819–819(.6) 810–819(.7) 810–819(.9) 800–807(.3), 820–825(.6), 826(.1, .9) 800–807(.2), 820–825(.7), 826–829(.0) 800–807(.0–.1, .8–.9), 820–825(.0–.5, .8–.9), 826(.2–.8), 827–829(.2–.9), 83, 833–845, 958.6, 988.6 916–917, 960.0, 968.2, 973, 975 846–848, 914–915, 918, 921, 923, 925–926, 929, 955(.5, .9), 958(.0, .4), 910.1, 965(.5–.9), 967, 968.4, 985.5, 988(.0, .4), 971, 978, 990–994, 996, 997(.0–.2) 928.8, 929.8, 958.8, 959, 968.8, 969, 988.8, 989, 977, 995, 997.8, 998–999 887, 928.9, 929.9, 958.9, 968.9, 988.9, 976, 977.9
Struck by, against Other specified, classifiable Other specified, NEC Unspecified Intent Categories Intent
ICD-9-CM E-Codes
Unintentional Self-inflicted Assault Undetermined Legal intervention
800–869, 880–929 950–959 960–969 980–989 970–978, 990–999
any main effects categories in the model. A detailed description of the interaction terms is found in Table 2. Forward stepwise logistic regression analysis was carried out on the design set, with in-hospital death as the dependent variable. Each of the 109 injury categories was included as a binary variable (representing presence or absence of injury in that category), as were terms for each of the mechanism and intent categories. Two-way interaction terms, as noted above, were also included in the stepwise regression. Age was modeled as a quadratic spline,15 with knots at ages 12, 25, and 65 years. Terms were included in the final model if the Wald statistic reached a significance of p ⫽ 0.05. All main effects that contributed to a significant interaction term were included in the model. The prediction model determined from the design set was then applied to the validation set. The coefficients for the predictor variables were used to calculate the Harborview Assessment for Risk of Mortality (HARM) score. This score takes the form of a probability, between 0 and 1, of inhospital mortality:
Pm⫽1/(1⫹e⫺).  ⫽ 0 ⫹ 1(age) ⫹ 2(age2) ⫹ . . . ⫹ 80(chest ⫻ spinal cord), where 0 is a constant and n is the regression coefficient associated with each term in Table 3. The presence or absence of each of the mechanisms and injury categories is represented by 1 or 0 in this equation. 534
Abbreviated Injury Scale severities were derived from the ICD-9-CM data for each of the patients using the ICDMAP program.16 These AIS scores were used to calculate an ISS and then combined with age and Revised Trauma Score data using published coefficients.17 An ICISS score9 was also derived for each validation set patient, using a dictionary of the survival risk ratios derived from the North Carolina Hospital Discharge database. All injury ICD-9-CM codes between 800 and 959.9 were used for this calculation. To compare the ability of HARM, ICISS, and Trauma and Injury Severity Score (TRISS) to discriminate between survivors and nonsurvivors, a receiver operator characteristic (ROC) curve18 was derived for each scoring system on the basis of the validation set. The area under the curve (AUC) was calculated and compared using a nonparametric method.19 The calibration characteristics of the three scores were compared using the Hosmer-Lemeshow (HL) statistic. Sensitivity and specificity were calculated for each score as well.
RESULTS A total of 33,990 admissions for blunt or penetrating trauma were recorded in the Harborview Medical Center Trauma Registry between July 1, 1985, and December 31, 1997. To avoid analyzing readmissions associated with the same injury for a given patient, only the first admission for any particular patient was included, resulting in 1,683 excluded stays. The final data set thus consisted of 32,307 admissions or emergency department deaths; 16,185 admisSeptember 2000
HARM: A New Probability of Survival Score
Table 2 Definitions of Interaction Categories and Interaction Terms Considered for the Model Definitions of Interaction Categories Broad Injury Category Included Injury Categories
Head
Neck
Spinal cord
Chest wall/pleura
Thoracic vascular
Lung
Heart
Liver
Spleen
Pancreas
Skull fracture Open intracranial wound Facial fracture Concussion Cerebral laceration or contusion Cerebellar/brainstem laceration Subarachnoid hemorrhage Epidural hemorrhage Subdural hemorrhage Unspecified intracranial hemorrhage Unspecified intracranial injury Fracture of larynx or trachea Uncomplicated open wound of larynx Uncomplicated open wound of larynx Complicated open wound of pharynx Uncomplicated open wound of pharynx Common/internal carotid artery injury External/unspecified carotid artery injury Internal jugular vein injury Other/unspecified vessel in neck Complete cord injury C4 or above Incomplete cord injury C4 or above Unspecified cord injury C4 or above Complete cord injury C5–T12 Incomplete cord injury C5–T12 Unspecified cord injury C5–T12 Lumbar/sacral/cauda equina injury Unspecified spinal cord injury Flail chest Fracture of 1–3 ribs Fracture of ⬎3 ribs Pneumothorax Hemothorax Hemopneumothorax Sternal fracture Thoracic aorta or great vessel injury SVC or innominate vein injury Pulmonary vessel injury Unspecified thoracic vessel injury Pulmonary contusion Pulmonary laceration Unspecified lung injury Cardiac laceration Full-thickness cardiac laceration Cardiac contusion Unspecified cardiac injury Minor liver laceration/contusion/hematoma Moderate liver laceration Major liver laceration Other/unspecified liver injury Hepatic vein injury Hematoma or capsular tear of spleen Moderate laceration of spleen Major laceration of spleen Injury to head of pancreas Injury to body of pancreas Injury to tail of pancreas Multiple/unspecified pancreatic injuries
SVC, superior vena cava; COPD, chronic obstructive pulmonary disease.
Volume 49 • Number 3
Table 2 continued Definitions of Interaction Categories Broad Injury Category Included Injury Categories
Abdominal vascular Abdominal aortic injury Inferior vena cava injury Celiac/mesenteric artery injury Portal vein injury Mesenteric venous injury Splenic vein injury Iliac vessel injury Renal vessel injury Other/unspecified abdominal vascular injury Kidney Minor kidney injury Major kidney injury Other/unspecified kidney injury Pelvis Open pelvic fracture Closed pelvic fracture Interaction Terms Considered for the Model
Head—chest wall Head—thoracic vascular Head—liver Head—pelvis Head—spinal cord Head—age ⬎65 Chest wall—spinal cord Chest wall—liver Chest wall—pelvis Chest wall—COPD Lung—COPD Heart—ischemic heart disease Liver—colon Liver—cirrhosis Liver—abdominal vascular Colon—abdominal vascular Colon—pancreas Colon—spleen Pancreas—abdominal vascular Pancreas—small intestine Pancreas—kidney Kidney—abdominal vascular Hip fracture—age ⬎65 Pelvis—age ⬎65 Pelvis—rectum
Table 3 Characteristics of study population Characteristic
Design Set (n ⫽ 16,185)
Validation Set (n ⫽ 16,122)
p Value
Mean age (y) Male gender (%) Blunt injury (%) Mean ISS Mortality (%)
34 73.8% 81.6% 11.9 6.7%
34.3 74.4% 81.6% 11.9 6.9%
0.1 0.2 0.3 0.8 0.3
sions were in the design set, and 16,122 were in the validation set. The study population was predominantly young and male, and most injuries were the result of blunt trauma (Table 3). The mean ISS was 10.9 (median 9). There were no significant differences between the design and validation sets 535
The Journal of TRAUMA威 Injury, Infection, and Critical Care
Table 4 Variables Included in Final Model Independent Variable
Constant Age-related variables Age Age2 Quadratic spline term, age 12–25 Quadratic spline term, age 25–65 Quadratic spline term, age ⬎65 Mechanism of injury Fall Firearm Machinery Motor vehicle collision, unspecified Motor vehicle collision, occupant Motor vehicle/motorcycle collision Motor vehicle/bicycle collision Motor vehicle/pedestrian collision Bicycle collision Pedestrian Other transport Struck by, against Other specified, classifiable Other specified, not elsewhere classified Unspecified Injury categories Loss of consciousness ⬎24 hours (irreversible) Loss of consciousness ⬍24 hours Skull fracture Open intracranial wound Facial fracture Concussion Cerebral laceration or contusion Cerebellar or brainstem hemorrhage Subarachnoid hemorrhage Epidural hemorrhage Subdural hemorrhage Unspecified intracranial hemorrhage Unspecified intracranial injury Complete spinal cord injury C4 or above Incomplete spinal cord injury C4 or above Unspecified spinal cord injury C4 or above Complete spinal cord injury C5–T12 Incomplete spinal cord injury C5–T12 Unspecified spinal cord injury C5–T12 Lumbar/sacral/cauda equina cord injury Unspecified spinal cord injury Complicated open wound of pharynx Other/unspecified neck vessel injury Flail chest Fracture of 1–3 ribs Fracture of ⬎3 ribs Pneumothorax Hemothorax Hemopneumothorax Cardiac contusion Cardiac laceration Full-thickness cardiac laceration Unspecified cardiac injury Pulmonary contusion Pulmonary laceration Thoracic aorta or great vessels Superior vena cava or innominate vein
536
Adjusted Odds Ratio
Regression Coefficient
⫺4.708587 0.81 1.01 0.99 1.00 1.00
⫺0.2163938 0.0109741 ⫺0.0122462 0.0019716 ⫺0.0008945
1.30 2.92 0.79 3.15 1.61 0.77 1.48 1.79 0.50 2.68 2.12 1.14 4.45 0.34 1.93
0.2621639 1.072484 ⫺0.2337287 1.148873 0.4736818 ⫺0.2593759 0.3913342 0.5829926 ⫺0.6982064 0.9867274 0.7495314 0.5390938 1.493163 ⫺1.087112 0.6551576
95.17 0.30 1.84 2.31 0.6485 0.5684 0.7112 6.10 1.50 0.86 2.43 1.42 1.35 30.93 6.55 1.79 0.30 1.18 1.24 0.12 0.01 7.50 3.92 0.78 0.58 1.63 1.14 2.16 1.16 3.16 22.36 67.20 32.03 1.72 27.31 13.48 28.44
4.555617 ⫺1.193073 0.6120652 0.8373468 ⫺0.4331615 ⫺0.5648866 ⫺0.3408512 1.807632 0.4054637 ⫺0.1501403 0.8876912 0.3487128 0.3017085 3.431704 1.879599 0.5846756 ⫺1.203703 0.1672258 0.2111382 ⫺2.156565 ⫺4.439985 2.015116 1.366395 ⫺0.2475337 ⫺0.5528944 0.4899484 0.1305839 0.771174 0.1452939 1.15051 3.107243 4.207728 3.466697 0.5399678 3.307087 2.600922 3.34766
September 2000
HARM: A New Probability of Survival Score
Table 4 continued Independent Variable
Colon injury Injury to tail of pancreas Moderate laceration of liver Major laceration of liver Hepatic vein injury Major laceration of spleen Minor kidney injury Other/unspecified abdominal organ Traumatic amputation above the knee Open pelvic fracture Closed pelvic fracture Abdominal aortic injury Inferior vena cava Mesenteric venous injury Comorbidities Congenital coagulopathy Cirrhosis Ischemic heart disease Hypertension Psychoses Alcohol or drug dependence Interaction terms Head ⫻ spinal cord Pelvis ⫻ (age ⬎65) Chest ⫻ spinal cord
with regard to age, gender, mortality, mean ISS, or proportion of injuries attributable to blunt trauma. The final logistic regression model contained 80 terms (Table 4). Of the 100 total injury categories considered, 51 were included in the model. Sixteen were not significant individually but were included as main effects terms for the modeled interactions. Six of the 11 terms representing comorbid conditions were included in the final model. All 16 mechanism categories were represented as 15 dummy variables. Three interaction terms were included. Finally, modeling age as a quadratic spline with three knots required five age-related variables. When applied to the validation set, the HARM model was able to calculate a probability of mortality in 16,097 of the 16,122 admissions (99.9%). ICISS was successful in 15,820 cases (98.1%). Because of missing physiologic data, a TRISS probability of survival score could be calculated for 9,923 (61.4%) of the patients in the validation set. HARM had the highest AUC at 0.958 (Fig. 2, Table 5). For ICISS, the AUC was 0.940. TRISS was in between, with an AUC of 0.946. HARM had a better fit to the validation data (HL statistic 21.37, p ⫽ 0.0315) than ICISS (HL 712.4, p ⫽ 0.0005) or TRISS (HL 59.54, p ⫽ ⬍ 0.005). Note that a smaller HL statistic connotes a better fit to the actual data. Because the TRISS score could be calculated only for 61.4% of the total validation set, ROC curves and HL statistics were derived for each model in the subset of records for which all three scores could be calculated. Nonparametric derivation of the AUC allows for a calculation of standard Volume 49 • Number 3
Adjusted Odds Ratio
Regression Coefficient
1.73 0.07 2.86 14.63 6.78 6.01 0.32 3.25 21.40 2.48 0.98 4.04 5.82 9.49
0.5462001 ⫺2.598521 1.050193 2.682833 1.913794 1.793359 ⫺1.136782 1.180044 3.06309 0.9092737 ⫺0.0239407 1.395601 1.761481 2.250205
4.46 19.20 2.68 0.58 0.16 0.46
1.494934 2.954898 0.9844608 ⫺0.546734 ⫺1.854641 ⫺0.7681033
2.12 2.13 5.01
0.7507725 0.7570994 1.611111
Fig. 2. Comparison of ROC curves for HARM, TRISS, and ICISS.
error, thus allowing relevant comparisons to be made.19 The HARM score maintains its larger AUC in this subset, and the difference is statistically significant (Table 6). To calculate the sensitivity and specificity of each score for predicting survival, an arbitrary cutoff was chosen for each score so that sensitivity was fixed at 95%. In other words, 95% of survivors had a predicted probability of survival above this cutoff value. Specificity was then determined 537
The Journal of TRAUMA威 Injury, Infection, and Critical Care
Table 5 Predictive Power for Survival Using HARM, TRISS, and ICISS Na
Area under ROC
HL Statistic
p Value for HL Statistic
16,097 9,923 15,820
0.958 0.946 0.939
19.74 55.71 709.19
0.0318 ⬍0.0005 ⬍0.0005
Model
HARM TRISS ICISS a
Number of patients for which score is calculable.
Table 6 Comparison of Area under ROC Curves for HARM, TRISS, and ICISS on the Basis of Identical Validation Sets (n ⴝ 9923) Model
HARM TRISS ICISS
Area under ROC
p Value for Comparison with HARM
0.958 0.947 0.940
— 0.035 ⬍0.0005
Table 7 Comparison of Specificities for HARM, TRISS, and ICISS Model
HARM
TRISS
ICISS
Sensitivity
Cutoff Probability
Specificity
85% 90% 95%
0.973 0.953 0.885
92.4% 90.3% 83.4%
85% 90% 95% 85%
0.961 0.913 0.772 0.833
91.1% 83.7% 72.1% 89.3%
90% 95%
0.775 0.670
85.6% 78.2%
Table 8 Ten Most Lethal Injuries in HARM Model Independent Variable
Adjusted Odds Ratio
Loss of consciousness for ⬎24 hours (irreversible) Full-thickness cardiac laceration Unspecified cardiac injury Complete spinal cord injury C4 or above Superior vena cava or innominate vein Pulmonary laceration Cardiac contusion Traumatic amputation above the knee Major laceration of liver Thoracic aorta or great vessels
95.2 67.2 32.0 30.9 28.4 27.3 22.4 21.4 14.6 13.5
at that cutoff point for each score and compared (Table 7). The specificity for HARM was 83.4%, for ICISS was 78.20%, and for TRISS was 72.1%.
DISCUSSION Although TRISS has been the de facto standard for the prediction of mortality among trauma patients for many years, it has several limitations.4,8 –9,20 –21,26 –28 In addition to 538
theoretical issues, perhaps the most important limitation is in its real-life applicability to patients with missing physiologic data. The goal of this study was to develop a scoring system that can predict mortality by using patient data that are reliably available by extracting as much information as possible out of that data. The resulting HARM score possesses excellent power in discriminating between survivors and nonsurvivors. In addition, it has better calibration than either TRISS or ICISS. The accurate assignment of AIS and ISS scores to patients requires the time and effort of an individual who understands the procedures for deriving the scores, be that a nurse coordinator or a trauma registrar. Many trauma centers, in an effort to obviate that need, rely on the ICDMAP program16 to convert ICD-9-CM injury diagnosis codes into AIS scores for use in calculation of ISS and TRISS. However, there is not a one-to-one correspondence for most of the ICD-9-CM diagnoses, and many injuries are inaccurately scored by this method. There have been promising results from several studies assessing the ability to predict survival on the direct basis of ICD-9-CM codes.9,23–26 A recent study concludes that an ICD-9-CM– based score is equally accurate whether on the basis of assignment by a dedicated trauma registrar or discharge data from the standard hospital information system.22 The HARM score coefficients are based on logistic regression analysis of more than 16,000 trauma patients at a Level I trauma center. The basis for the score is a modification of the ICD-9-CM coding scheme, which has been shown in several studies to be useful in stratifying trauma patients. Logistic regression was chosen for the statistical method in this study because of its ability to model the interactions between independent variables and because it does not require the assumption of independence among variables. Variables were selected for inclusion in the final model on the basis of their ability to add predictive power to the final score. The adjusted odds ratios for the 10 most significant injury categories are listed in Table 4. Examination of these top 10 odds ratios reveals a familiar pattern to those who are involved in clinical trauma care. Prolonged coma is overwhelmingly the most important variable in the model, underscoring the devastating effects of major head injury on survival. Cardiac injuries, both specific ones, such as a fullthickness laceration, and unspecified ones, were also important in the model. Major liver injury, pulmonary laceration, and various abdominal vascular injuries are also on the list. September 2000
HARM: A New Probability of Survival Score Several comorbid factors were also found to be important, most notably, cirrhosis, congenital coagulopathy, and ischemic heart disease (IHD). The most important determinants of survival in this model seem to reflect clinical experience. Although many different interactions were explored, there were only three that were significant enough for inclusion in the final model. Pelvic fracture was associated with a higher mortality in older patients (⬎65 years) than in younger patients after adjustment for the presence of all other injuries. Spinal cord injuries had a positive interaction with both head and chest injuries. It is unclear why the interaction factors did not play more of a role in the model. It may have been because of the overwhelming influence of coma and other injury categories. Another explanation may be that the broader categories defined for the interactions led to a watering-down effect as more severe injuries were included in the same category with less severe injuries. Different modeling methodologies, and perhaps larger design data sets, may make it easier to capture the clinically recognized importance of synergism between injuries. HARM assigns a probability of mortality without the use of physiologic data. The underlying hypothesis is that given enough information about patients’ physiologic reserves, mechanisms of injury, and anatomic injury patterns, their responses to an insult can be predicted. One potential weakness of HARM, therefore, is that two patients with the same injuries, mechanism, comorbidities, and age would have the same score regardless of their vital signs in the emergency room. However, the point of HARM is to predict survival on the basis of factors established at the time of the injury itself. This approach avoids the inherent problems with physiologic data: there is often no way to know the time elapsed from the injury until the vital signs were taken, and inconsistent use of prehospital versus emergency room data will yield different physiologic scores. TRISS fails to assign a probability score to a significant percentage of patients. In this study, the ROC curves were recalculated for HARM and ICISS on the subset of patients for which TRISS did assign a score. The AUCs remain unchanged for each score for this subset, and it seems that HARM clearly outperforms the other two scores. HARM achieved this high level of calibration without the use of physiologic data and predicted a score in nearly every patient in the data set. In using any probability score in an applied setting, such as for quality assurance, it is important to use the proper cut point as a divider between predicted survivors and nonsurvivors. The sensitivity and specificity for any particular score vary depending on what cutoff is used. Sensitivity will increase only at the expense of specificity and vice versa. Therefore, to compare the sensitivity and specificity of any group of scores, one of the two values must be fixed. For the purposes of this comparison, the sensitivity was fixed at 95%, and the specificity at that point was compared. It can be seen from Table 7 that the price (in misclassification of nonsurviVolume 49 • Number 3
vors) paid by HARM for its high sensitivity is less than that for both TRISS and ICISS. The split sample technique used in this study addresses the issue of validity by applying the HARM model to a second set of unique data and determining how well the model predicts for a population different from that for which it was designed. HARM was found to work well in this second sample. However, this does not assure the ability to generalize the results of this study to other settings. The Harborview Medical Center patient population may be homogeneous in comparison with other hospital populations. In addition, because the basis of the HARM is in ICD-9-CM codes, its ability to predict mortality in the Harborview data set may be related to local consistencies in coding; other hospitals with slightly different coding conventions may not fare as well with the present HARM coefficients. Application of the HARM model to a data set from a different hospital with different coding conventions may not achieve the same results. True verification of the model will require testing it with a completely independent data set, preferably one with a heterogeneous collection of data sources. ICD-10-CM coding will soon be adopted for use in coding hospitals discharges. If the HARM method is validated at other institutions, this suggests that it will be worthwhile creating a new set of coefficients on the basis of the ICD-10-CM codes, using a method similar to that used to develop the HARM score.
CONCLUSION This study confirms earlier work that indicates that injury severity scores based on ICD-9-CM codes can predict mortality with as much or more accuracy than those based on the AIS rubric with considerably less effort and expense. Furthermore, it suggests that this predictive power can be gained without the use of physiologic data from the emergency department. The HARM score, as outlined here, is a predictive score that can accurately predict outcome on the basis of data that are derived entirely from factors that are established before or at the time of an injury and easily abstracted from standard hospital discharge data. Perhaps more importantly, it can be used not only for patients for whom physiologic data are available but for any patient whose injuries are known.
REFERENCES 1.
DeHaven H. The Site, Frequency and Dangerousness of Injury Sustained by 800 Survivors of Light Plane Accidents. New York: Crash Injury Research, Cornell University Medical College; 1952. 2. Association for the Advancement of Automotive Medicine. The Abbreviated Injury Scale,1990 Revision. Des Plaines, IL: AAAM; 1990. 3. Teasdale G, Jennett B. Assessment of coma and impaired consciousness: a practical scale. Lancet. 1974;2:81– 84. 4. Baker SP, O’Neill B, Haddon W, et al. The injury severity score: a method for describing patients with multiple injuries and evaluating emergency care. J Trauma. 1974;14:187–196.
539
The Journal of TRAUMA威 Injury, Infection, and Critical Care 5. 6. 7.
8. 9. 10.
11.
12.
13.
14.
15. 16.
17. 18. 19.
20.
21.
22.
23.
24.
25.
26.
27.
Champion HR, Sacco WJ, Carnazzo AJ, et al. Trauma score. Crit Care Med. 1981;9:672– 676. Champion HR, Sacco WJ, Copes WS, et al. A revision of the Trauma Score. J Trauma. 1989;29:623– 629. Copes WS, Sacco WJ, Champion HR, et al. Progress in characterizing anatomic injury severity. Paper presented at the 33rd Annual Proceedings, Association for the Advancement of Automotive Medicine, 1989; Des Plaines, IL. Champion HR, Copes WS, Sacco WJ, et al. A new characterization of injury severity. J Trauma. 1990;30:539 –545. Osler T, Rutledge R, Deis J. ICISS: an International Classification of Disease-9 based injury severity score. J Trauma. 1996;41:380 –388. Morris JA, MacKenzie EJ, Edelstein SL. The effect of preexisting conditions on mortality in trauma patients. JAMA. 1990;263:1942– 1946. Milzman DP, Boulanger BR, Rodriguez A, et al. Pre-existing disease in trauma patients: a predictor of fate independent of age and injury severity score. J Trauma. 1992;32:236 –244. Bonnie RJ, Fulco CE, Liverman CT, eds. Reducing, the Burden of Injury: Advancing Prevention and Treatment. New York: National Academy Press; 1999. Commission of Professional Hospital Activities. International Classification of Disease, 9th Revision, Clinical Modification. Ann Arbor, MI: Edwards Brothers; 1977. Centers for Disease Control, and Prevention. Recommended framework for presenting injury mortality data. MMWR. 1997;46:1– 30. Greenland S. Dose-response and trend analysis in epidemiology: alternatives to categorical analysis. Epidemiology. 1995;6:356 –365. Mackenzie EJ, Steinwachs CM, Shankar B. Classifying trauma severity based on hospital discharge diagnoses: validation of an ICD9 CM to AIS-85 conversion table. Med Care. 1989;27:412. Champion HR, Sacco WJ, Copes WS. Injury severity scoring again [editorial]. J Trauma. 1995;38:94 –95. Metz CE. Basic principles of ROC analysis. Semin Nucl Med. 1978; 8:283–298. DeLong E, DeLong D, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837– 845. Markle J, Cayten CT, Burne DW, et al. Comparison between TRISS and ASCOT methods in controlling for injury severity. J Trauma. 1992;33:326. Hannan EL, Mendeloff J, Farrell LS, et al. Validation of TRISS and ASCOT using a non-MTOS trauma registry. J Trauma. 1995;38:83– 88. Osler T, Cohen M, Rogers FB, et al. Trauma registry injury coding is superfluous: a comparison of outcome prediction based on trauma registry International Classification of Diseases, 9th Revision (ICD9), and hospital information ICD-9 codes. J Trauma. 1997;43:253– 257. Rutledge R. Injury severity and probability of survival assessment in trauma patients using a predictive hierarchical network model derived from ICD-9 codes. J Trauma. 1995;38:590 – 601. Rutledge R, Fakhry S, Baker C, et al. Injury severity grading in trauma patients: a simplified technique base upon ICD-9 coding. J Trauma. 1993;35:497–507. Rutledge R, Hoyt D, Eastman AE, et al. Comparison of the injury severity score and ICD-9 diagnosis codes as predictors of outcome in injury: analysis of 44, 032 patients. J Trauma. 1997;42:477– 489. Rutledge R, Osler T, Emery S, et al. The end of the ISS and TRISS: ICISS outperforms both ISS and TRISS as predictors of trauma patient survival, hospital charges, and hospital length of stay. J Trauma. 1998;44:41– 49. Osler T. Injury severity scoring: perspectives in development and future directions. Am J Surg. 1993;165:43–51S.
540
28.
Champion HR, Copes WS, Sacco WJ, et al. Improved predictions from ASCOT over TRISS: results of an independent evaluation.
EDITORIAL COMMENT In this issue of the Journal, West et al. examine the Harborview Assessment for Risk of Mortality (HARM), a new predictor of trauma patient mortality based on ICD-9CM. The HARM methodology departs substantially from most earlier efforts to predict mortality for trauma patients in several ways. First, it does not use any physiologic measures. Second, it takes preexisting disease (comorbidities) into account. Third, it includes numerous (15) mechanisms of injury in its final logistic regression model rather than merely separating patients on the basis of blunt and penetrating injuries, or on the basis of more aggregated mechanisms such as MVCs, low falls, other blunt injuries, stab wounds, and gunshot wounds. Fourth, it reclassifies 2,034 ICD-9-CM injury diagnosis codes into 109 injury categories, 51 of which were used in its final statistical model. Fifth, it includes interactions of injury categories. The final HARM statistical model utilizes a total of 80 variables, with 5 representing age, 15 for mechanisms of injury, 51 for injury categories, 6 for comorbidities, and 3 interaction terms. The authors conclude that HARM outperforms Trauma and Injury Severity Score (TRISS) and International Classification of Diseases, Ninth Revision (ICD-9)-based Injury Severity Score (ICISS) with respect to both discrimination and calibration, the two most commonly used measures of fit for logistic regression models. The fact that HARM does not require emergency department physiologic data is claimed as an additional advantage. We believe that there are a few caveats that should be expressed regarding these findings. First, for TRISS, the authors used published coefficients derived from the MTOS study which is at least 10 years old. ICISS scores were derived using a dictionary of survival/risks ratios derived from the North Carolina Hospital discharge database. By validating HARM using a split sample method from one hospital and then comparing the results to TRISS and ICISS with coefficients derived from vastly different databases, the authors have biased the study in terms of favoring their new model. The truest test of whether this model outperforms TRISS and ICISS would be to derive TRISS coefficients and ICISS risk ratios from the Harborview data set, and then compare all three models using either the Harborview or an independent data set.1 In particular, the comparisons of calibration among HARM, TRISS, and ICISS are inappropriate when the underlying population mortality rates (in the initial MTOS [for TRISS], North Carolina [ICISS] and Harborview [HARM] are different.1 Regarding the specific statistical model that is presented by West et al., there are several factors/variables that do not have face validity in that they appear to be protective of mortality, yet clearly should not be. Examples are comorbidities such as alcohol or drug dependence, psychoses, and September 2000
HARM: A New Probability of Survival Score hypertension. For example, the interpretation of the contribution of psychoses to the model is that patients with psychoses have odds of dying in the hospital that are 0.16 times (roughly one sixth) the odds of patients without psychoses dying in the hospital, all other risk factors being the same. This clearly makes no sense and is perhaps just a function of a relatively small number of patients with psychoses in the study (or a relatively small number whose psychoses were coded). If this methodology is transported to other sites that do not have sufficient data to identify their own significant variables and associated coefficients, factors without face validity should be removed. One could also argue that these factors should not be used internally at Harborview. Also, there are some other investigations that would be of interest in exploring the relative advantages/disadvantages of HARM. For example, the contention that HARM outperforms methodologies that include physiologic measures would have been more convincing if the Revised Trauma Score (RTS) data elements had been added to ICISS as they were to the Injury Severity Score (ISS) to form TRISS. As indicated by Rutledge et al.,2 ICISS is a better predictor when physiologic measures are added. It would also have been interesting to see whether HARM could have been substantially improved by adding physiologic measures, or whether they would have been superfluous given the large number of other variables present. Despite the caveats presented above, we believe that the paper by West et al. is a valuable addition to the literature on predicting adverse outcomes for trauma patients. The authors have done some creative and valuable “out of the box”
Volume 49 • Number 3
thinking in proposing a major departure from previous methodologies. Their ideas of subdividing broader categories of mechanisms of injury and explicitly using numerous injury categories in the model are particularly appealing and intriguing. We anxiously look forward to other studies that compare HARM with TRISS and ICISS/RTS, and applaud West et al. for their efforts to enhance our ability to understand the factors that influence mortality among trauma patients. Edward L. Hanan, PhD Department of Health Policy, Management, and Behavior School of Public Health State University of New York, University at Albany Albany, New York C. Gene Cayten, MD, MPH Department of Surgery New York Medical College and Our Lady of Mercy Medical Center Bronx, New York
REFERENCES 1.
2.
Hannan E, Farrell L, Gorthy S, et al. Predictors of mortality in adult patients with blunt injuries in New York State: a comparison of the Trauma and Injury Severity Score (TRISS) and the International Classification of Disease, Ninth Revision-based Injury Severity Score (ICISS). J Trauma. 1999;47:8 –14. Rutledge R, Osler T, Emery S, et al. The end of the ISS and TRISS: ICISS outperforms both ISS and TRISS as predictors of trauma patient survival, hospital charges, and hospital length of stay. J Trauma. 1998;44:41– 49.
541