ACP American College of Physicians - Internal Medicine - Doctors for Adults

Effective Clinical Practice

A primer on scores - what counts?

July/August 2000

To judge the effects of clinical interventions, researchers look for changes in certain key variables - better known as outcome measures. Some of the most familiar (and most important) outcome measures are dichotomous variables (so-called "0,1 variables") - they either happen or they don't. Examples include heart attacks, strokes and death. Others outcome measures can take on multiple values. Physiologic and laboratory measurements fall in this category (such as blood pressure, serum sodium and CD4 count), as do various functional status and symptom scales (such as the Glasgow Coma Scale to classify level of consciousness and Visual Analog Scales to classify level of pain).

Over the past two decades a new type of outcome measure has been increasingly used in clinical research: scores. A score is a composite measure, in other words it is derived from multiple individual variables. A score may be the composite of multiple dichotomous variables, multiple physiologic and laboratory measurements, multiple scales or any combination thereof. Scores are used primarily to measure multi-attribute patient function (e.g. Mini-mental Status Score is a metric for classifying the combined functions of: orientation, computational ability, and short-term memory) or to predict risk of various outcomes (e.g. the risk of heart attack, breast cancer or death).

Because they may summarize a number of different variables (each of which may be given different weight), it can be very difficult to know what a score really means. If the topic is of interest and primary outcome is a score, critical readers should seek answers to the following questions (see Table). (If you can't answer these questions, it's tough to know what counts as an important effect.)

1. What's being measured?

The first step is to try to get a handle on the construct. This can be harder than you think. Like so many things in medicine, scores often go by their acronym (and even when you know what the acronym stands for, you may not be that much closer to the construct). Consider the following examples. PCS stands for physical component summary; it is an overall measure of physical function assessed by self-report (part of the Medical Outcomes Study SF-36). APACHE II stands for acute physiologic and chronic health evaluation (second version); it is a prognostic measure for intensive care unit patients that is used to predict inpatient mortality.

2. Which end is up?

Sometimes it is hard to know whether a higher score is good thing or a bad thing. A high PCS score, for example, is good. A high APACHE II score, on the other hand, most definitely is not.

3. What's possible?

Knowing the range of possible values is the next step for getting a feel for the results. Some scores ranges from 0 to 100 (such as the PCS score). But many do not (APACHE II ranges from 0 to 71).

4. What are some benchmarks?

Then a reader needs context - some grounding on what an expected score would be for defined set of individuals. For the PCS score published norms are available.1 For example, in the general US population, the average PCS score for men over age 65 is 42. A healthy 40-year old will have an APACHE II score of 0.

5. What matters?

Finally, a reader needs help to make judgments about constitutes an important change. In other words, a reader needs a clinical correlation. A fall of 5 points in the PCS score, for example, is equivalent to developing a new chronic disease like congestive heart failure. Of course, its not perfect (the severity of congestive heart failure varies from person to person, as does its impact) but it's a lot better than nothing. A change in APACHE II from 12 to 24 is associated with an absolute increase in inpatient mortality of 30% (from approximately 10% to over 40%).

To make sense of scores, readers should try to answer the forgoing questions. Unfortunately, authors often fail to provide the needed information. In these cases, if readers want to really understand what a score means, they must do the hard work themselves.

Table - Questions to answer in order to understand a score.

Question Answer
What is being measured?  
Which end is up?  
What is possible?  
What are some benchmarks?  
What matters?  
* Finding the answers can be challenging. One excellent resource for understanding functional health scores is: McDowell I, Newell C. Measuring Health (2nd Edition). Oxford University Press, Oxford, England. 1996.

Reference

  1. SF-36 physical and mental health summary scales: A user's manual. Boston: The Health Institute, New England Medical Center, 1994.