Journal Home
Search for

Volume 89, Issue 9, Pages 1693-1700 (September 2008)


View previous. 12 of 36 View next.

Estimating Minimal Clinically Important Differences of Upper-Extremity Measures Early After Stroke

Catherine E. Lang, PT, PhDabcCorresponding Author Informationemail address, Dorothy F. Edwards, PhDbcd, Rebecca L. Birkenmeier, MS OTR/Lb, Alexander W. Dromerick, MDe

Abstract 

Lang CE, Edwards DF, Birkenmeier RL, Dromerick AW. Estimating minimal clinically important differences of upper-extremity measures early after stroke.

Objective

To estimate minimal clinically important difference (MCID) values of several upper-extremity measures early after stroke.

Design

Data in this report were collected during the Very Early Constraint-induced Therapy for Recovery of Stroke trial, an acute, single-blind randomized controlled trial of constraint-induced movement therapy. Subjects were tested at the prerandomization baseline assessment (average days poststroke, 9.5d) and the first posttreatment assessment (average days poststroke, 25.9d). At each time point, the affected upper extremity was evaluated with a battery of 6 tests. At the second assessment, subjects were also asked to provide a global rating of perceived changes in their affected upper extremity. Anchor-based MCID values were calculated separately for the affected dominant upper extremities and the affected nondominant upper extremities for each of the 6 tests.

Setting

Inpatient rehabilitation hospital.

Participants

Fifty-two people with hemiparesis poststroke.

Interventions

Not applicable.

Main Outcome Measures

Estimated MCID values for grip strength, composite upper-extremity strength, Action Research Arm Test (ARAT), Wolf Motor Function Test (WMFT), Motor Activity Log (MAL), and duration of upper-extremity use as measured with accelerometry.

Results

MCID values for grip strength were 5.0 and 6.2kg for the affected dominant and nondominant sides, respectively. MCID values for the ARAT were 12 and 17 points, for the WMFT function score were 1.0 and 1.2 points, and for the MAL quality of movement score were 1.0 and 1.1 points for the 2 sides, respectively. MCID values were indeterminate for the dominant (composite strength), the nondominant (WMFT time score), and both affected sides (duration of use) for the other measures.

Conclusions

Our data provide some of the first estimates of MCID values for upper-extremity standardized measures early after stroke. Future studies with larger sample sizes are needed to refine these estimates and to determine whether MCID values are modified by time poststroke.

Article Outline

Abstract

Methods

Participants

Protocol

Upper-Extremity Evaluation

Grip strength

Composite upper-extremity strength

Action Research Arm Test

Wolf Motor Function Test

Motor Activity Log

Duration of use

Data Analyses

Results

Distribution of Subjects' Perceived Change Ratings

Mean Change Scores for Each Rating and Estimated MCIDs

Discussion

Cautions Before Interpreting Our Estimates

Interpreting Our Estimates

Study Limitations

Conclusions

Acknowledgment

References

Copyright

MEASUREMENT IN REHABILITATION is an ongoing concern. For poststroke rehabilitation, there are now a respectable number of standardized clinical measures that assess upper-extremity deficits in people with hemiparesis.1, 2 These clinical measures assess upper-extremity deficits poststroke at the impairment level (eg, grip strength, FMA), at the activity limitation level (eg, ARAT, WMFT), at the participation restriction level (MAL), or at multiple levels simultaneously (eg, Stroke Impact Scale). Over the past few decades, the body of literature examining psychometric properties of upper-extremity measures has grown tremendously. For many measures, reliability and validity have been sufficiently established for recommended use in daily clinical practice and in research. A newer focus on the studies of clinical measurement poststroke is the concept of the MCID.1

The MCID has been defined as “the smallest difference in score in the domain of interest which patients perceive as beneficial and which would mandate, in the absence of troublesome side effects and excessive cost, a change in the patient's management.”3(p408) MCID values are therefore important in interpreting the clinical relevance of observed changes at both the individual and group levels. MCID values can be derived a number of ways, with the various methods often categorized as either distribution-based or anchor-based.4 The distribution-based methods estimate MCID values based on the statistical characteristics of change scores within a sample. A limitation of the distribution-based methods is that the derived values do not typically indicate the importance of the score. The anchor-based methods estimate MCID values by comparing change scores with an anchor, usually either a patient or a clinician rating of change. A benefit of the anchor-based method is that the estimate is based on changes that are considered important to the patients or clinicians.

It is critical to appreciate that there is no single true MCID value for a given measure. MCID values are dynamic and context-specific.5, 6 In the area of stroke rehabilitation, factors that may affect MCID values include time since stroke, magnitude of initial deficits, and patient expectations of recovery. Published estimates of MCID values are available for 2 global activity level measures, the FIM instrument and the Barthel Index,7, 8, 9 but we could find no data-based estimates of MCID values for measures that assess upper-extremity deficits in people with hemiparesis poststroke. It is therefore time to begin to develop estimates of MCID values in these measures, particularly within the first few weeks and months after stroke, when most rehabilitation services are provided and most rehabilitation monies are spent.10, 11

The purpose of this report was to estimate MCID values of several upper-extremity measures early after stroke. We chose to estimate MCID values using a patient anchor-based method instead of a distribution based method6, 12 because the anchor-based method directly reflects the point of view of the patients.3, 13 We consider the patients' view the criterion standard for judging important changes, particularly for measures that are trying to quantify constructs such as functional abilities. We performed separate analyses for affected dominant and nondominant upper extremities because estimates of MCID values could be influenced by whether the dominant versus nondominant upper extremity was affected by the stroke. A secondary purpose of this report was to see whether the magnitude of the MCID estimates was similar across scales—that is, a similar proportion of the scale. This was worth evaluating, because if estimates of MCID were similar, then clinicians might be able to use a proportional rule to estimate MCID values on other, conceptually similar measures.13

Methods 

return to Article Outline

Participants 

Fifty-two subjects with hemiparesis poststroke were enrolled in this study (table 1). Hemiparetic subjects were participating in the VECTORS study, a single-center randomized controlled trial investigating early motor recovery of the upper extremity after stroke conducted at the Washington University School of Medicine. Subjects were recruited via the Cognitive Rehabilitation Research Group Stroke Registry from the acute neurology service of Barnes Jewish Hospital and from the rehabilitation service of the Rehabilitation Institute of St Louis. In all, 1850 patients with stroke were screened to achieve the enrolled 52 subjects. Subjects were included in the trial if they had (1) an ischemic or hemorrhagic stroke within 28 days of admission to inpatient rehabilitation; (2) persistent hemiparesis as indicated by a score of 1 to 3 on the motor arm item of the NIHSS; (3) the presence of some upper-extremity voluntary activity as indicated by the ability to move proximal and/or distal joints against gravity; (4) evidence of preserved cognitive function, as indicated by scores of 0 or 1 on the consciousness items of the NIHSS and a score of greater than 19 on the Short Blessed Memory Orientation and Concentration test14; (5) the ability to follow 2-step commands, as indicated by a score of 0 on the item 1c of the NIHSS; and (6) no upper-extremity injury or conditions that limited use prior to the stroke. Subjects were excluded from the trial if they (1) could not give informed consent, (2) had clinically significant fluctuations in mental status in the 72 hours prior to enrollment, (3) had hemispatial neglect as assessed by the Star Cancellation Test, and/or (4) were not expected to survive 1 year because of other illnesses (eg, cardiac disease, malignancy). Subject characteristics are provided in table 1; data from this cohort have been reported previously.15, 16 The Human Studies Committee of the Washington University School of Medicine approved the protocol for this study. Written informed consent was obtained from all subjects prior to testing.

Table 1.

Subject Characteristics (N=52)

VariableValues
Age (y)64±14
Admission NIHSS score5.3±1.8
Premorbid Barthel Index score99.6±2.2
Premorbid Modified Rankin Index score0.3±0.6
Time to day 0 evaluation (days since stroke)9.5±4.5
Time to day 14 evaluation (days since stroke)25.9±10.6
Sex
Male21(40)
Female31(60)
Race
White22(42)
African American29(56)
Other1(2)
Stroke type
Ischemic41(79)
Hemorrhagic11(21)
Affected side
Dominant23(44)
Nondominant29(56)

NOTE. Values are mean ± SD or n (%).

Protocol 

The data reported in this study are from the first 2 VECTORS evaluations: the prerandomization baseline assessment (study day 0; average days poststroke, 9.5d) and the first posttreatment assessment (study day 14; average days poststroke, 25.9d). Consistent with other reports investigating MCIDs, data from all subjects were pooled, regardless of group assignment in the trial.3, 17, 18, 19 All evaluations were performed by trained study personnel who were blinded to group assignment. At each evaluation session, the affected (contralateral to the lesion) upper extremity was evaluated using a battery of tests, as described below. At the second evaluation session, subjects were also asked to provide a global rating of perceived changes in their affected upper extremity by comparing “how well your arm is doing” using the following 7-point Likert scale: score 1 was much better; score 2 was a little better, meaningful; score 3 was a little better, not meaningful; score 4 was about the same; score 5 was a little worse, not meaningful; score 6 was a little worse, meaningful; and score 7 was much worse.

We chose a 7-point scale19, 20 to evaluate our subjects' ratings of change instead of a larger 15-point scale3 because a portion of the sample was not literate and may have had difficulty understanding the finer gradations of the 15-point scale. The scores on this item were then used as the anchors for grouping subjects during the calculation of MCID estimates.

Upper-Extremity Evaluation 

The affected upper extremity was evaluated using a battery of tests that included measurement at the level of impairments, activity limitations, and participation restrictions. Impairment measures included in this report are grip strength and composite upper-extremity strength. Activity measures included in this report are the ARAT and the WMFT. Participation measures included in this report are the MAL and the duration of arm use measured with accelerometers.

Grip strength 

Affected hand grip strength is a common impairment reliably assessed by a variety of stroke professionals21 and has been proposed as a surrogate measure for upper-extremity outcomes.22 Here, grip strength was assessed during the WMFT (see below) via a dynamometer measurement of the maximum amount of force produced during a palmar grip.23 Subjects were seated with the upper extremity in 0° of shoulder flexion and 90° of elbow flexion. A Jamar grip dynamometera was used with the handle position set at 3 for all measurements for all subjects. At each session, 3 measurements were taken, and the average value was used in the analyses. Grip strength values were expressed in kilograms.

Composite upper-extremity strength 

Strength is one of the most often assessed impairments poststroke. Here, strength of the shoulder, elbow, and wrist flexor and extensors were measured using a hand-held dynamometerb following a standard protocol,24 except that subjects were seated during testing. Warm-up or practice trials were not performed prior to testing. Maximal voluntary isometric strength values were recorded in kilograms for each muscle group tested. Subjects unable to produce force against the dynamometer were given a score of 0kg for that particular muscle group. The strength of each muscle group was expressed as the ratio of affected side to unaffected side maximal isometric force. Ratio values from each muscle group were averaged to form a single, composite score for the affected upper extremity for each subject.25, 26

Action Research Arm Test 

The ARAT assesses activity limitations of the upper extremity.27 It includes 19 items divided into 4 subscales: grasp, grip, pinch, and gross movement. Reliability (interrater, .99; test-retest, .98) and validity of the ARAT have been well established.15, 27, 28, 29, 30, 31 Performance on the ARAT is strongly correlated to performance on the upper-extremity motor portion of the FMA and to performance on the box and block test.31, 32 Item scores on the ARAT are summed to create subtest and full-scale scores with a maximum score of 57 indicating normal performance.

Wolf Motor Function Test 

The WMFT is a 17-item measure used to assess activity limitations of the upper extremity. It is comprised of 2 strength items and 15 timed task performance items. The task performance items begin with the measurement of simple proximal movements and progress to more complex distal and whole-limb movements. The WMFT yields 2 scores: (1) a functional ability score quantifying quality of performance and (2) a timed score quantifying speed of performance in seconds. The test has published reliability and validity.33, 34, 35, 36 In the VECTORS study, the key use task was not collected because of difficulties with instrumentation, and the results reported do not include this item.

Motor Activity Log 

The MAL is a measure of self-perceived upper-extremity participation restrictions.37, 38, 39 It uses a semistructured interview to assess how much and how well patients use their affected arm for ADLs over a specified period of time. Thirty specific ADL tasks are evaluated using a 6-point amount of use scale (how much) and a 6-point quality of movement scale (how well). The tasks include activities such as buttoning a shirt, brushing teeth, and using a key. Adequate interrater reliability (>.91) and internal consistency (α>.81) have been reported.39 The quality of movement scale, or how well the arm functioned in the ADL tasks, was collected at both study day 0 and day 14 assessments and is therefore included in this report. The amount of use scale, or how much the arm was used, was not administered at study day 0, making it impossible to calculate change scores and include the results in this report.

Duration of use 

The duration of use in a 24-hour period is an upper-extremity–specific measure of participation. We have previously reported on duration of upper-extremity use in this cohort.40 Briefly, duration of use was captured using uniaxial accelerometers.c Accelerometers were placed on the distal arm just above the wrist with the axis parallel to the length of the arm. Subjects wore the accelerometers at all times during a 24-hour data collection period except when the devices would be exposed to water (eg, personal hygiene). Data were collected in 2-second epochs over the 24 hours. Using an established methodology that provides a valid (r range, .93–.99) and reliable (test-retest, .90) measure of the duration of upper-extremity use,41, 42 data from each 2-second epoch were used to classify the upper extremity as either moving or not moving during that 2-second period. The sum of the epochs when the upper extremity moved then represented the duration of upper-extremity movement over the 24-hour period. For ease of communication, this summed variable was converted from seconds to hours. It should be noted that accelerometer measures of upper-extremity use have some limitations because they provide information about whether the upper extremity was moving, but not information about what it was moving to do (eg, functional task versus arm swing during gait). Thus, the duration of use obtained via accelerometry may be considered small overestimations of the time spent using the upper extremity for functional activities.40

Data Analyses 

SPSSd for Windows was used for all statistical analyses. Individual change scores were calculated for each measure by subtracting scores at study day 0 from scores at study day 14. Change scores for each measure were normally distributed as evaluated by Kolmogorov-Smirnov tests.

The magnitude of change that is considered meaningful may be influenced by whether the dominant versus nondominant upper extremity was affected by the stroke. Data from affected dominant upper extremities were therefore analyzed separately from data from affected nondominant upper extremities. To estimate the MCIDs on each measure, mean change scores for each perceived change rating were computed. Consistent with other recent reports using anchor-based methods in this population,8, 9 the mean change score for the smallest meaningful change (here a score of 2 is “a little better, meaningful”) was taken as the MCID. In addition to expressing the MCID as a change in raw scores on each measure, we also calculated the MCID as a percentage of the total scale (when possible) and as a single-population effect size,43 where the change score is divided by the SD of the study day 0 score. Expressing the MCIDs in these alternative formats allows for comparison of MCIDs across the measures evaluated.

Results 

return to Article Outline

Fifty-two subjects with hemiparesis poststroke were included in this report (see table 1). Subjects were an average of 64±14 years old and had been largely independent prior to the stroke. The first evaluations occurred an average of 9.5 days poststroke (study day 0), and the second evaluations occurred an average of 25.9 days poststroke (study day 14). The sample had a greater proportion of women than men and was 58% nonwhite. Seventy-nine percent of subjects had ischemic strokes, and 56% of subjects were affected on their nondominant side.

Scores on each upper-extremity measure from the study day 0 and day 14 time points are provided in table 2. We considered our sample to be moderately affected at the day 0 time point and mildly to moderately affected at the day 14 time point, although there was a fair amount of variation in performance that can be seen in the SDs of each measure. As expected in an inpatient rehabilitation setting early after stroke, the sample improved on all measures from study day 0 to study day 14. This can be seen in the mean change scores provided in the last column of table 2. Note that improvement on the WMFT time score is indicated by a negative number—that is, shorter time to complete the test items. For all other measures, improvement is indicated by a positive number.

Table 2.

Means and Change Scores at the Study Day 0 and Day 14 Time Points

Mean ± SDChange ± SD
Impairment measures
Grip strength (kg)
D09.6±10.56.9±7.3
D1416.8±12.5
Composite strength (ratio of unaffected side)
D00.34±0.280.22±0.20
D140.56±0.33
Activity measures
ARAT
D022.5±15.315.1±11.4
D1438.1±16.6
WMFT time (s)
D042.5±39.8−22.6±28.8
D1418.4±28.4
WMFT function
D02.4±1.11.2±0.8
D143.6±1.1
Participation measures
MAL how well
D00.5±0.51.2±0.9
D141.7±1.1
Duration of use (h)
D03.3±1.81.2±1.4
D144.4±2.1

NOTE. Improvement on the WMFT time score is indicated by a negative number—that is, shorter time to complete test items. For all other measures, improvement is indicated by a positive number.

Abbreviations: D0, study day 0 (average days poststroke, 9.5d); D14, study day 14 (average days poststroke, 25.9d).

Distribution of Subjects' Perceived Change Ratings 

Overall, the distribution of perceived change ratings was positively skewed. Twenty-nine subjects (56%) rated their upper extremity a 1 (“much better”), 12 subjects (23%) rated their upper extremity a 2 (“a little better, meaningful”), 4 subjects (8%) rated their upper extremity a 3 (“a little better, not meaningful”), and 7 subjects (13%) rated their upper extremity a 4 (“about the same”). No subject rated their upper extremity as having become worse (scores of 5–7). Figure 1 shows the distribution of perceived change ratings based on whether the dominant or the nondominant upper extremity was affected by the stroke. Dominant and nondominant affected sides were equally represented in the groups of subjects rating their upper extremities as a 1 or 2. More subjects who rated themselves a 3 had their dominant arm affected, and all subjects who rated themselves a 4 had their nondominant arm affected.


View full-size image.

Fig 1. Frequency of subjects' perceived change ratings. Subjects are grouped by whether the dominant or the nondominant upper extremity was affected by the stroke. Perceived change ratings: 1, much better; 2, a little better, meaningful; 3, a little better, not meaningful; 4, about the same; 5, a little worse, not meaningful; 6, a little worse, meaningful; 7, much worse.


Mean Change Scores for Each Rating and Estimated MCIDs 

Fig 2, Fig 3, Figure 4 show mean change scores for each measure plotted as a function of perceived change ratings and affected side. Subjects who rated their affected upper extremities as a 3 or 4 were grouped together in each graph because both ratings represent less than a meaningful change. For the impairment level measures, grip strength change scores (see fig 2A) had a generally linear relationship with the perceived change rating for both the dominant and nondominant groups. Composite upper-extremity strength change scores (see fig 2B) for the nondominant group (open symbols) had a linear relationship with the perceived change ratings. This was not the case, however, for the group affected on the dominant side (filled symbols). Subjects who rated their affected dominant upper extremity as having not changed or not meaningfully changed (score of 3 or 4) had somewhat higher change scores than those who rated their affected dominant upper extremity as having a small, meaningful change (score of 2).


View full-size image.

Fig 2. Mean change scores in impairment level measures by perceived change ratings and upper extremity affected. (A) Grip strength in the affected hand, measured in kilograms as part of the WMFT. (B) Composite upper-extremity strength on the affected side; values are expressed as ratios of the unaffected upper extremity. Error bars represent SEs. Perceived change ratings: 1, much better; 2, a little better, meaningful; 3, a little better, not meaningful; 4, about the same.



View full-size image.

Fig 3. Mean change scores in activity level measures by perceived change ratings and upper extremity affected. (A) ARAT; (B) WMFT time score; and (C) WMFT function score. Error bars represent SEs. Perceived change ratings: 1, much better; 2, a little better, meaningful; 3, a little better, not meaningful; 4, about the same.



View full-size image.

Figure 4. Mean change scores in participation level measures by perceived change ratings and upper extremity affected. (A) MAL quality of movement score. (B) Duration of affected upper-extremity use, measured by wrist accelerometers and expressed in hours. Error bars represent SEs. Perceived change ratings: 1, much better; 2, a little better, meaningful; 3, a little better, not meaningful; 4, about the same.


For the activity level measures, ARAT scores (see fig 3A) had a linear relationship with the perceived change rating for both the dominant and nondominant groups. The WMFT time change scores (see fig 3B) had a linear relationship when the dominant side was affected but had little relationship with perceived change ratings when the nondominant side was affected. For this measure, subjects who rated their nondominant affected upper extremity as being much better (score of 1) had change scores that were similar to change scores of the subjects who rated their nondominant affected upper extremity as changing less (scores 2, 3, and 4). The WMFT function change scores (see fig 3C) had a better relationship with the perceived change ratings than the WMFT time scores.

For the participation level measures, the MAL how well change scores (see fig 4A) had a linear relationship with perceived change ratings, with the dominant and nondominant groups very similar. The duration of upper-extremity use measured with accelerometers (see fig 4B) had no clear relationships with perceived ratings of change for either the dominant or the nondominant groups. Regardless of which side was affected, the largest changes in use were found in those people who considered their affected upper extremity as having not meaningfully changed or not changed at all (score of 3 or 4).

We used the mean change score for the smallest meaningful change (here a score of 2 is “a little better, meaningful”) as the estimate of the MCID in our sample of people with acute hemiparesis. Table 3 shows estimated values of MCID for each measure. For some measures, the mean change score for subjects who rated themselves a 2 was smaller than the mean change score for those who rated themselves a 3 or 4 (see figs 2B, 3B, 4B). The MCID on these measures was classified as unknown. When expressed as a percentage of the total scale, estimated MCID values ranged from 16% to 21% in the group that was affected on the dominant side, and 18% to 30% in the group that was affected on the nondominant side. When expressed as an effect size, estimated MCID values ranged from 0.48 to 2.00 and from 0.59 to 2.20 in the dominant and nondominant groups, respectively.

Table 3.

Estimates of MCID for Each Measure Based on Whether the Dominant or the Nondominant Upper Extremity Is Affected

MeasuresMCID if Dominant Side AffectedMCID if Nondominant Side Affected
Raw ValuePercentage of ScaleEffect SizeRaw ValuePercentage of ScaleEffect Size
Impairment measures
Grip strength (kg)5.0NV0.486.2NV0.59
Composite strength (ratio of unaffected side)NVNV0.2220.79
Activity measures
ARAT12210.7817301.10
WMFT time (s)−19160.48NVNV
WMFT function1.0170.911.2201.00
Participation measures
MAL quality of movement score1.0172.001.1182.20
Duration of use (h)NVNVNVNV

NOTE. Values are raw scores, percentages of the total scale (where appropriate), and effect sizes.

Abbreviation: NV, no value.

We were unable to estimate MCID values when there was no apparent relationship between the perceived change rating and the mean change on a given measure (see figs 2B, 3B, 4B).

To calculate percentage, total scale was assumed to range from 0.0 to 1.0.

To calculate percentage, total scale was assumed to range from 0 to 120 seconds.

Discussion 

return to Article Outline

Our purpose was to begin to examine how much change constitutes a clinically meaningful difference on upper-extremity assessments early after stroke. Using subjects' global ratings of upper-extremity change, we attempted to estimate MCID values for 2 impairment level measures, 3 activity level measures, and 2 participation level measures during the first month poststroke (see table 3). Estimates of MCID ranged from 16% to 30% of the total scales. For a few measures, we were unable to estimate MCID values because there appeared to be little relationship between change scores and perceived change ratings. Our data from this sample of people with acute hemiparesis poststroke provide some of the first estimates of MCID values for upper-extremity standardized measures. As such, they are an important contribution to the rehabilitation literature because clinicians need to evaluate meaningful change in individual patients and researchers need to evaluate meaningful change across patient groups.

Cautions Before Interpreting Our Estimates 

Our estimates are from a sample of people with acute hemiparesis poststroke, studied during their inpatient rehabilitation stay. It is not known how these estimates might generalize to people with more chronic hemiparesis or to people in other clinical settings. In the first month poststroke, substantial improvements in neuromotor capabilities and in function often occur,44, 45 particularly for those people selected to go to inpatient rehabilitation facilities in the United States. It may be that people with acute hemiparesis require a greater change in movement capabilities to consider the change clinically meaningful because (1) a large portion of the recovery happens during this period and (2) patients have strong expectations for recovery early after their stroke. Thus, our MCID values might be higher than MCID values at later time points poststroke, when people with hemiparesis may have a greater awareness of how smaller changes may be functionally beneficial and may have lower expectations for full recovery.

Most people (56%) in our sample considered themselves to be “much better” at the second evaluation. This created a skewed distribution of perceived change scores (see fig 1), with a smaller portion of people who considered themselves to have changed a little or not at all. Skewed distributions of perceived change scores are common in studies investigating MCID values.3, 13, 20, 46, 47 Given the substantial recovery that occurs in this early period poststroke, it is nearly impossible to obtain a nonskewed distribution of perceived change scores.8, 9 We consider that the primary effect of the skewed distribution was that this left us with a smaller number of people (n=12) from which to estimate the MCID values.

The number of subjects used here to calculate MCID values is similar to numbers in other anchor-based reports in the rehabilitation literature.9, 46, 47 Because of the small sample, the variability in change scores was large (see SE bars in Fig 2, Fig 3, Figure 4). Distributions of change scores in our dataset overlap for different perceived change ratings. This means that we had subjects with change scores greater than the calculated MCID values who considered themselves “about the same,” and we had subjects with change scores less than the calculated MCID values who considered themselves “much better.” Our MCID values should be interpreted with caution, particularly when making judgments about individual patients.

Interpreting Our Estimates 

Based on our data, it appears that change scores of 16% to 30% on these upper-extremity measures may be needed for a person to consider an improvement to be meaningful. In comparison, change scores of 5% to 10% have been found to be clinically meaningful on a number of health-related QOL measures in a variety of patient populations.3, 18, 19, 47 It is possible that our estimates are proportionally higher because health-related QOL measures are designed to assess changes that are inherently meaningful to the patient, whereas upper-extremity measures are designed to assess changes in upper-extremity motor capacity, some aspects of which may be less inherently meaningful to the patient. Thus, a greater proportional change may need to occur before the change is considered meaningful on focal upper-extremity measures versus more global QOL measures.

There are only a small number of MCID values available for comparison in the neurorehabilitation literature. We have been unable to find any estimates of MCID values for impairment level measures such as grip strength. For the activity level measures, a change of 6 points on the ARAT was selected as clinically meaningful in a population of people with chronic hemiparesis because it represented approximately 10% of a 57-point scale.48 Our data suggest that in an acute hemiparetic population, the patient-perceived MCID is 12 points for the dominant side and 17 for the nondominant side, representing 21% and 30% of the scale, respectively. On the WMFT time score, an improvement of 19 seconds on the affected dominant side (16% of the 120-second limit) was considered a meaningful change in our sample, but we were unable to obtain an estimate for the affected nondominant side. Interestingly, a recent report in people with subacute hemiparesis indicated that it was follow-up time scores on the WMFT and not change scores that predicted perceived upper-extremity recovery.38 This could imply that people may place more importance on their current level of function than on how much they have progressed when determining benefits of treatments. For the participation level measures, a change of 0.5 points (10%) on the MAL was selected as clinically meaningful in a population of people with chronic hemiparesis, again because it represented approximately 10% of the scale.48 Our data suggest that in an acute hemiparetic population, the patient-perceived MCID on the MAL is at least 1 point (17%−18% of the scale). Interestingly, our MCID values were proportionally similar to MCID values on 2 global activity measures in the stroke population, where the MCID of the Barthel Index is 1.85 on the 20-point scale (19%; acute population),9 and the MCID of the motor portion of the FIM instrument is 17 on the 105-point scale (16%; subacute population).8

We determined MCID values separately for the affected dominant and nondominant sides. Our estimated MCID values were a little smaller when the dominant side was affected than when the nondominant side was affected (see table 3). Although people typically use the dominant and nondominant upper extremities a similar amount during daily life,40 the 2 sides are used somewhat differently. The dominant upper extremity is used for more skilled manipulation (eg, holding and writing with a pen), whereas the nondominant upper extremity is used in a more supportive role (eg, holding the paper still during writing). A limited ability to use the dominant side after stroke may be more burdensome than a limited ability to use the nondominant side. It then follows that a smaller change in the ability to use the affected dominant side poststroke may be perceived more quickly and considered more meaningful than a similar size change in the ability to use the affected nondominant side. Because the distributions of our dominant and nondominant affected estimates overlap (see size of SE bars in Fig 2, Fig 3, Figure 4), our data can only provide preliminary support for this idea.

Study Limitations 

We were unable to estimate MCID values in instances in which there was no apparent relationship between the perceived change score and the amount of change. This was the case for composite upper-extremity strength when the dominant side was affected, for the WMFT time score when the nondominant side was affected, and for the duration of use when either side was affected (see Fig 2, Fig 3, Figure 4, table 3). One explanation for our inability to detect a relationship between the perceived change score and the amount of change may be our small sample size. Thus, the lack of relationship may be a result of the large variability in our sample. This variability may be masking a real relationship that could be uncovered with larger sample sizes, from which it may then be possible to obtain estimates of MCID values. We consider this a real possibility for composite upper-extremity strength measure and for the WMFT time score, where a relationship existed when one side was affected but not when the other side was affected. An alternative explanation for our inability to detect a relationship may be that change scores on these measures are not related to patient-perceived change. For example, it may have little meaning to the patient that their duration of upper-extremity use improves by 1.5 hours. If this alternative explanation is true, then change scores on these particular measures may be less useful for assessing outcomes in individual patients or in research studies.

Conclusions 

return to Article Outline

Our data provide some of the first estimates of what constitutes a clinically meaningful change on standardized upper-extremity measures early after stroke. Proportional changes of 16% to 30% were considered important to the people in our study. Care should be taken when attempting to generalize these results to people with chronic stroke or to people in other rehabilitation settings. Future studies with larger sample sizes are needed to refine our estimates and to determine how MCID values are affected by time since stroke, by initial stroke severity, and/or by clinical setting.

Suppliers

Acknowledgments 

return to Article Outline

We thank Joanne Wagner, PT, PhD, ATC, and Lily Hu, MS, for their assistance with data collection and the therapists who assisted with recruitment and scheduling during this project.

References 

return to Article Outline

1. 1Barak S, Duncan PW. Issues in selecting outcome measures to assess functional recovery after stroke. NeuroRx. 2006;3:505–524. Abstract | Full Text | Full-Text PDF (390 KB) | CrossRef

2. 2Finch E, Brooks D, Stratford PW, Mayo NE. Physical rehabilitation outcome measures. Hamilton: BC Decker; 2002;.

3. 3Jaeschke R, Singer J, Guyatt GH. Measurement of health status: ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10:407–415. MEDLINE | CrossRef

4. 4de Vet HC, Terwee CB, Ostelo RW, Beckerman H, Knol DL, Bouter LM. Minimal changes in health status questionnaires: distinction between minimally detectable change and minimally important change. Health Qual Life Outcomes. 2006;4:54. MEDLINE | CrossRef

5. 5Beaton DE, Boers M, Wells GA. Many faces of the minimal clinically important difference (MCID): a literature review and directions for future research. Curr Opin Rheumatol. 2002;14:109–114. MEDLINE | CrossRef

6. 6Hays RD, Woolley JM. The concept of clinically meaningful difference in health-related quality-of-life research: how meaningful is it?. Pharmacoeconomics. 2000;18:419–423. MEDLINE | CrossRef

7. 7Wallace D, Duncan PW, Lai SM. Comparison of the responsiveness of the Barthel Index and the motor component of the Functional Independence Measure in stroke: the impact of using different methods for measuring responsiveness. J Clin Epidemiol. 2002;55:922–928. Abstract | Full Text | Full-Text PDF (81 KB) | CrossRef

8. 8Beninato M, Gill-Body KM, Salles S, Stark PC, Black-Schaffer RM, Stein J. Determination of the minimal clinically important difference in the FIM instrument in patients with stroke. Arch Phys Med Rehabil. 2006;87:32–39. Abstract | Full Text | Full-Text PDF (121 KB) | CrossRef

9. 9Hsieh YW, Wang CH, Wu SC, Chen PC, Sheu CF, Hsieh CL. Establishing the minimal clinically important difference of the Barthel Index in stroke patients. Neurorehabil Neural Repair. 2007;21:233–238. MEDLINE

10. 10Caro JJ, Huybrechts KF, Duchesne I. Management patterns and costs of acute ischemic stroke: an international study. For the stroke Economic Analysis Group Stroke. 2000;31:582–590. MEDLINE

11. 11Spieler JF, Lanoe JL, Amarenco P. Costs of stroke care according to handicap levels and stroke subtypes. Cerebrovasc Dis. 2004;17:134–142. MEDLINE | CrossRef

12. 12Wyrwich KW, Wolinsky FD. Identifying meaningful intra-individual change standards for health-related quality of life measures. J Eval Clin Pract. 2000;6:39–49. MEDLINE | CrossRef

13. 13Ringash J, O'Sullivan B, Bezjak A, Redelmeier DA. Interpreting clinically significant changes in patient-reported outcomes. Cancer. 2007;110:196–202.

14. 14Katzman R, Brown T, Fuld P, Peck A, Schechter R, Schimmel H. Validation of a short orientation-memory-concentration test of cognitive impairment. Am J Psychiatry. 1983;140:734–739.

15. 15Lang CE, Wagner JM, Dromerick AW, Edwards DF. Measurement of upper-extremity function early after stroke: properties of the action research arm test. Arch Phys Med Rehabil. 2006;87:1605–1610. Abstract | Full Text | Full-Text PDF (229 KB) | CrossRef

16. 16Dromerick AW, Lang CE, Powers WJ, et al. Very Early Constraint Therapy for Recovery from Stroke (VECTORS): phase II trial results [abstract]. Stroke. 2007;38:465.

17. 17Coeytaux RR, Kaufman JS, Chao R, Mann JD, Devellis RF. Four methods of estimating the minimal important difference score were compared to establish a clinically significant change in headache impact test. J Clin Epidemiol. 2006;59:374–380. Abstract | Full Text | Full-Text PDF (137 KB) | CrossRef

18. 18Wyrwich KW, Nienaber NA, Tierney WM, Wolinsky FD. Linking clinical relevance and statistical significance in evaluating intra-individual changes in health-related quality of life. Med Care. 1999;37:469–478. MEDLINE | CrossRef

19. 19Osoba D, Rodrigues G, Myles J, Zee B, Pater J. Interpreting the significance of changes in health-related quality-of-life scores. J Clin Oncol. 1998;16:139–144.

20. 20Lauridsen HH, Hartvigsen J, Manniche C, Korsholm L, Grunnet-Nilsson N. Responsiveness and minimal clinically important difference for pain and disability instruments in low back pain patients. BMC Musculoskelet Disord. 2006;7:82. MEDLINE | CrossRef

21. 21Bertrand AM, Mercier C, Bourbonnais D, Desrosiers J, Gravel D. Reliability of maximal static strength measurements of the arms in subjects with hemiparesis. Clin Rehabil. 2007;21:248–257. MEDLINE | CrossRef

22. 22Boissy P, Bourbonnais D, Carlotti MM, Gravel D, Arsenault BA. Maximal grip force in chronic stroke subjects and its relationship to global upper extremity function. Clin Rehabil. 1999;13:354–362. MEDLINE | CrossRef

23. 23Schmidt RT, Toews JV. Grip strength as measured by the Jamar dynamometer. Arch Phys Med Rehabil. 1970;51:321–327. MEDLINE

24. 24Andrews AW, Thomas MW, Bohannon RW. Normative values for isometric muscle force measurements obtained with hand-held dynamometers. Phys Ther. 1996;76:248–259. MEDLINE

25. 25Wagner JM, Lang CE, Sahrmann SA, Edwards DF, Dromerick AW. Sensorimotor impairments and reaching performance in subjects with poststroke hemiparesis during the first few months of recovery. Phys Ther. 2007;87:751–765. MEDLINE

26. 26Wagner JM, Lang CE, Sahrmann SA, et al. Relationships between sensorimotor impairments and reaching deficits in acute hemiparesis. Neurorehabil Neural Repair. 2006;20:406–416. MEDLINE

27. 27Lyle RC. A performance test for assessment of upper limb function in physical rehabilitation treatment and research. Int J Rehabil Res. 1981;4:483–492. MEDLINE

28. 28van der Lee JH, De Groot V, Beckerman H, Wagenaar RC, Lankhorst GJ, Bouter LM. The intra- and interrater reliability of the Action Research Arm Test: a practical test of upper extremity function in patients with stroke. Arch Phys Med Rehabil. 2001;82:14–19. Abstract | Full Text | Full-Text PDF (59 KB) | CrossRef

29. 29van der Lee JH, Beckerman H, Lankhorst GJ, Bouter LM. The responsiveness of the Action Research Arm Test and the Fugl-Meyer Assessment Scale in chronic stroke patients. J Rehabil Med. 2001;33:110–113. MEDLINE | CrossRef

30. 30Hsieh CL, Hsueh IP, Chiang FM, Lin PH. Inter-rater reliability and validity of the Action Research Arm Test in stroke patients. Age Ageing. 1998;27:107–113. MEDLINE | CrossRef

31. 31De Weerdt W, Harrison MA. Measuring recovery of arm-hand function in stroke patients: a comparison of the Brunnstrom-Fugl-Meyer test and the Action Research Arm Test. Physiother Can. 1985;37:65–70. CrossRef

32. 32Platz T, Pinkowski C, van Wijck F, Kim IH, di Bella P, Johnson G. Reliability and validity of arm function assessment with standardized guidelines for the Fugl-Meyer Test, Action Research Arm Test and Box and Block Test: a multicentre study. Clin Rehabil. 2005;19:404–411. MEDLINE | CrossRef

33. 33Wolf SL, Catlin PA, Ellis M, Archer AL, Morgan B, Piacentino A. Assessing Wolf Motor Function Test as outcome measure for research in patients after stroke. Stroke. 2001;32:1635–1639.

34. 34Wolf SL, Lecraw DE, Barton LA, Jann BB. Forced use of hemiplegic upper extremities to reverse the effect of learned nonuse among chronic stroke and head-injured patients. Exp Neurol. 1989;104:125–132. MEDLINE | CrossRef

35. 35Taub E, Miller NE, Novack TA, et al. Technique to improve chronic motor deficit after stroke. Arch Phys Med Rehabil. 1993;74:347–354. MEDLINE

36. 36Morris DM, Uswatte G, Crago JE, Cook EW, Taub E. The reliability of the Wolf Motor Function Test for assessing upper extremity function after stroke. Arch Phys Med Rehabil. 2001;82:750–755. Abstract | Full Text | Full-Text PDF (48 KB) | CrossRef

37. 37van der Lee JH, Beckerman H, Knol DL, de Vet HC, Bouter LM. Clinimetric properties of the Motor Activity Log for the assessment of arm use in hemiparetic patients. Stroke. 2004;35:1410–1414. CrossRef

38. 38Fritz SL, George SZ, Wolf SL, Light KE. Participant perception of recovery as criterion to establish importance of improvement for constraint-induced movement therapy outcome measures: a preliminary study. Phys Ther. 2007;87:170–178. MEDLINE | CrossRef

39. 39Uswatte G, Taub E, Morris D, Vignolo M, McCulloch K. Reliability and validity of the upper-extremity Motor Activity Log-14 for measuring real-world arm use. Stroke. 2005;36:2493–2496. CrossRef

40. 40Lang CE, Wagner JM, Edwards DF, Dromerick AW. Upper extremity use in people with hemiparesis in the first few weeks after stroke. J Neurol Phys Ther. 2007;31:56–63.

41. 41Uswatte G, Miltner WH, Foo B, Varma M, Moran S, Taub E. Objective measurement of functional upper-extremity movement using accelerometer recordings transformed with a threshold filter. Stroke. 2000;31:662–667. MEDLINE

42. 42Uswatte G, Foo WL, Olmstead H, Lopez K, Holand A, Simms LB. Ambulatory monitoring of arm movement using accelerometry: an objective measure of upper-extremity rehabilitation in persons with chronic stroke. Arch Phys Med Rehabil. 2005;86:1498–1501. Abstract | Full Text | Full-Text PDF (161 KB) | CrossRef

43. 43Kazis LE, Anderson JJ, Meenan RF. Effect sizes for interpreting changes in health status. Med Care. 1989;27:S178–S189. MEDLINE | CrossRef

44. 44Duncan PW, Lai SM, Keighley J. Defining post-stroke recovery: implications for design and interpretation of drug trials. Neuropharmacology. 2000;39:835–841. MEDLINE | CrossRef

45. 45Jorgensen HS, Nakayama H, Raaschou HO, Vive-Larsen J, Stoier M, Olsen TS. Outcome and time course of recovery in stroke, part II: time course of recovery (The Copenhagen Stroke Study). Arch Phys Med Rehabil. 1995;76:406–412. Abstract | Full-Text PDF (666 KB) | CrossRef

46. 46Iyer LV, Haley SM, Watkins MP, Dumas HM. Establishing minimal clinically important differences for scores on the Pediatric Evaluation of Disability Inventory for inpatient rehabilitation. Phys Ther. 2003;83:888–898. MEDLINE

47. 47Juniper EF, Guyatt GH, Willan A, Griffith LE. Determining a minimal important change in a disease-specific quality of life questionnaire. J Clin Epidemiol. 1994;47:81–87. MEDLINE | CrossRef

48. 48van der Lee JH, Wagenaar RC, Lankhorst GJ, Vogelaar TW, Deville WL, Bouter LM. Forced use of the upper extremity in chronic stroke patients: results from a single-blind randomized clinical trial. Stroke. 1999;30:2369–2375. MEDLINE

a Program in Physical Therapy, Washington University School of Medicine, St. Louis, MO

b Program in Occupational Therapy, Washington University School of Medicine, St. Louis, MO

c Department of Neurology, Washington University School of Medicine, St. Louis, MO

d Department of Kinesiology, University of Wisconsin, Madison, WI

e National Rehabilitation Hospital, Washington, DC

Corresponding Author InformationReprint requests to Catherine E. Lang, PT, PhD, Program in Physical Therapy, Washington University School of Medicine, Campus Box 8502, 4444 Forest Park, St. Louis, MO 63108

 Supported by the National Institutes of Health (grant nos. NS41261, HD047669), and the James S. McDonnell Foundation (grant no. 21002032).

 No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit on the authors or on any organization with which the authors are associated.

a Sammons Preston Rolyan, 1000 Remington Blvd, Ste 210, Bolingbrook, IL 60440-5117.

b MicroFET2; Hogan Health Industries, 8020 S 1300 W, West Jordan, UT 84088.

c Model 7164-2.4 Activity Monitors; MTI Health Services, 709 Anchors St NW, Fort Walton Beach, FL 32548.

d Version 13.0; SPSS Inc, 233 S Wacker Dr, 11th Fl, Chicago, IL 60606.

PII: S0003-9993(08)00405-X

doi:10.1016/j.apmr.2008.02.022


View previous. 12 of 36 View next.