Journal Home
Search for

Volume 88, Issue 12, Pages 1614-1621 (December 2007)


View previous. 11 of 46 View next.

Development and Validation of the Balance Outcome Measure for Elder Rehabilitation

Terry Haines, PhDabCorresponding Author Informationemail address, Suzanne S. Kuys, BPhtybc, Greg Morrison, BPhtyb, Jane Clarke, BPhtyd, Paul Bew, BPhtye, Steven McPhail, BPhtyb

Abstract 

Haines T, Kuys SS, Morrison G, Clarke J, Bew P, McPhail S. Development and validation of the Balance Outcome Measure for Elder Rehabilitation.

Objective

To develop and investigate the internal consistency, criterion-related validation, and minimum clinically significant difference of a new standing balance outcome measure for Elder Rehabilitation.

Design

Three phases: (1) cross-sectional survey with expert panel, (2) multicenter prospective cohort randomly divided into development and validation datasets, and (3) prospective cohort (single site).

Setting

Geriatric and rehabilitation units across 2 states in Australia.

Participants

A total of 1769 admissions across 17 geriatric assessment and rehabilitation units.

Interventions

Not applicable.

Main Outcome Measures

The Balance Outcome Measure for Elder Rehabilitation (BOOMER) consisted of the step test, Timed Up & Go test, Functional Reach Test, and static standing with feet together and eyes closed test. Criterion-related validity was established through comparison to the Modified Elderly Mobility Scale (MEMS) and the FIM motor score.

Results

Items of the BOOMER were already used at a majority of rehabilitation facilities surveyed. The BOOMER showed high levels of internal consistency (Cronbach α>.87) and had good correlation with the FIM motor and the MEMS (ρ>.72). The minimum clinically significant change in the BOOMER was 3 points over a 17-point scale range.

Conclusions

The BOOMER is a clinically applicable measure of standing balance among older rehabilitation patients with evidence of content and construct validity.

Article Outline

Abstract

Methods

Phase 1

Phase 2

Data analysis

Phase 3

Data analysis

Results

Phase 1

Phase 2

Phase 3

Discussion

Study Limitations

Conclusions

References

Copyright

STANDING BALANCE IS considered an essential prerequisite for the successful and safe performance of many daily activities including walking.1 Standing balance impairment occurs in many older adults as a result of age-related decline and/or pathology and is well recognized as a risk factor for falls.2, 3, 4 Rehabilitation of standing balance is often a core component of geriatric rehabilitation programs and is frequently addressed by multidisciplinary rehabilitation teams. Many developed countries presently face an aging demographic for which the health care burden of conditions related to balance impairment is likely to dramatically increase in coming years.5, 6 In view of this growing problem, it is imperative that clinically effective and economically efficient standing balance rehabilitation assessment and treatment programs be identified and implemented in clinical practice.

Balance is a complex concept requiring integration of input from sensory and musculoskeletal systems.7 The balance construct includes the ability to remain still (static), to appropriately respond to a postural challenge (internally or externally derived), and to move the body over the base of support and from 1 point to another (dynamic).8 We also consider gait to be a fundamental functional expression of the ability to maintain standing balance and deem that there is overlap between the constructs of standing balance and mobility.

Many tools exist to measure standing balance, but as yet there does not seem to exist a criterion standard of balance measurement, possibly because of the complexity of this construct.9 Many balance measures show poor sensitivity and poor generalizability limiting their usefulness in the clinical setting.10, 11, 12 Measures of standing balance can be classified as single- or multi-item tests, depending on the number of different procedures contained within each test set. For example, the Functional Reach Test (FRT)13 and step test14 can be classified as single-item tests with the Clinical Test for Sensory Integration in Balance (CTSIB)15, 16 and Berg Balance Scale17 classified as multiple item tests. Single-item tests tend to be quick and easy to administer; focus on a small proportion of domains that make up the overall construct; and, consequently, do not readily provide a global measure of the overall standing balance construct. Multiple-item tests commonly encompass a broader range of domains that make up the standing balance construct and produce a global measure of the standing balance construct if the test procedure is completed. This is useful for making many clinical decisions, such as whether a patient has sufficient balance to be discharged from inpatient care or whether a patient requires balance rehabilitation therapy. In these circumstances, it is impossible to predict the range of tasks and balance challenges that may confront a patient. Hence, a “global” balance scale that samples a range of underlying domains of the standing balance construct will provide a better guide for answering these clinical questions than individual item tests. However, global, multi-item tests tend to take longer to administer and are likely to place a greater time and physical burden on the patient.3 Consequently, it is plausible for reasons of practicality that many geriatric rehabilitation providers may avoid the use of such scales in preference to conducting a small number of single-item standing balance tests.12

The issues surrounding the selection of standing balance assessments are particularly relevant to providers of geriatric rehabilitation across inpatient, outpatient, and domiciliary care settings. Many existing measures of standing balance have been developed and investigated among community-dwelling elders and those in residential aged care facilities.18, 19, 20, 21, 22 However, few have been investigated for their validity throughout all settings of elder rehabilitation. It is essential that outcome measures be valid across multiple settings because elders may present on numerous occasions in these different settings. Measures should also reflect all levels of performance. With some measures having floor and ceiling effects or a pass or fail criteria, the ability to measure performance throughout the spectrum of patient performance should be regarded as important.23 Furthermore, some authors9 have advocated that simple measures of balance should be used in clinical practice but not at the expense of potentially useful information such as a global measure of standing balance. Thus, there is a need to identify simple balance measures for use in clinical practice and to determine if they can provide even greater information when used collectively.

In this study, we sought to investigate the possibility of using previously validated balance and mobility measures to develop a new global measure of the standing balance construct. It was intended that this measure be practically applicable (time and resource efficient); be sensitive to change; possess acceptable content validity, construct validity, and internal consistency; and have a readily identifiable minimum clinically significant change criterion.

Methods 

return to Article Outline

This study was conducted in 3 phases.

Phase 1 

Phase 1 included item selection and was designed to establish content validity and establish practical applicability. To achieve this, we used a cross-sectional survey and expert panel. Participants included rehabilitation physiotherapy (PT) service providers who were members of the Queensland Rehabilitation Physiotherapists Network (QRPN). The QRPN is a coalition of PT practitioners from publicly and privately funded PT services across Queensland, Australia. The expert panel was made up of 8 senior clinical physiotherapists with 5 to 20 years of clinical experience from QRPN member sites including metropolitan and regional and public and private facilities.

Information on balance measures currently used, their ease of applicability, and clinicians’ willingness to use them was sought through the cross-sectional survey. Twenty-two QRPN member sites were provided with a list of 12 measures. For each, facilities were asked if the measure was currently used at that facility. If the measure was not currently used, facilities were asked to indicate if they would be willing to use that measure as a component of their standard care if it was a part of this state-wide dataset. An option for facilities to nominate measures other than those listed that they used and would be willing to use in the state-wide dataset was also provided.

The survey was distributed in January 2005. Results were viewed by the expert panel, which then identified 4 single-item tests for inclusion in the outcome measure to be constructed. These measures were selected on the criteria that they were already used by responding facilities or that the responding facilities indicated that they would be happy to use the measure in future (practical applicability) and that the tests selected collectively examined a diverse range of underlying domains of the overall standing balance construct (content validity). These measures would then form the nucleus of the core dataset being collected by participating sites in phase 2 of this research program.

Phase 2 

Phase 2 included item scaling and attempted to establish the instrument’s sensitivity to change, to establish its internal consistency, and to identify minimum clinically significant change criterion. To achieve this, we used a multicenter prospective cohort, with split-sample validation.

Participants included people admitted consecutively for rehabilitation PT (inpatient, outpatient, domiciliary) at a participating site across 2 states in Australia. Patients with spinal cord injury who were wheelchair bound and patients with recent amputation (and had not yet received a prosthesis) were excluded from this trial because measures of standing balance were deemed to be irrelevant to their care at this stage.

The 4 measures selected from phase 1 of this trial included the step test,14 the Timed Up & Go (TUG) test,24 the FRT,19 and timed static standing with eyes closed and feet together (a component of the CTSIB).15, 16 These measures, selected by the expert clinician workgroup consulted in this research, were considered representative of the essential domains within the standing balance construct (static, dynamic, function) and that the measurement of these domains was essential to a “global” measure of standing balance.

The clinometric properties of each of these tests has previously been examined by using older adults with a range of clinical conditions and in a range of clinical settings.13, 14, 16, 18, 19, 24, 25, 26, 27, 28 After 2 practice trials, only 1 test trial of the FRT was recorded because (1) previous research indicates that although agreement varies, the correlation between the first trial and the average of 3 trials is excellent (intraclass correlation coefficient=.90),29 and (2) the expert workgroup noted that 1 test trial of the FRT was more reflective of current clinical practice. The static timed standing with eyes closed retained its 3 trials; however, if the first trial recorded the maximum score of 30 seconds, then the subsequent 2 trials were automatically scored as 30 seconds as per the original procedure.16, 30 The step test is a measure of a dynamic single-limb stance task in which the foot is placed onto the top of a 7.5-cm block and back to the floor as many times as possible in 15 seconds.14 Participants performed the step test with each foot, and the average of 2 step test trials (1 with each foot) was recorded. The TUG test is a dynamic and functional performance measure commonly used in rehabilitation practice as a measure of overall mobility, balance, and gait disorders.24, 31 In addition, the TUG samples the ability of an individual to turn 180° while maintaining the upright position and the ability to maintain the upright standing position immediately after transition from a seated posture.

Patient demographic data were collected along with measures of cognition (Mini-Mental State Examination32) and function (FIM instrument motor score33 or Modified Barthel Index [MBI]34) if these were collected as a part of routine care at each site. MBI scores were converted to FIM motor scores following a conversion system previously developed for the Australian setting.35 Physiotherapists also recorded if improving standing balance was the first-, second-, third-, fourth-, or lower-ranked goal on their prioritized treatment plan for that patient. Standard PT practice is to develop a prioritized problem list so that the problems of greatest importance (that can be addressed by a physiotherapist) can be focused on in treatment sessions. From this, prioritized treatment goals are formed.

The project investigative team contacted all member facilities of the QRPN along with other facilities with which professional contacts were held. Sites collected data for between 1 and 12 months from May 2005 to May 2006. Each site that participated used the 4 balance measures collected as a part of their routine admission and discharge assessments. Each site was asked to collate data from their routine assessments of consecutive patients meeting the inclusion and exclusion criteria for 6 months. A site representative was selected at each participating site. These representatives were responsible for disseminating the study protocol manual to all physiotherapists at their sites and to ensure consistent data collection in line with the instructions set out in the study protocol manual. Site representatives were also responsible for collating data at each site before forwarding data to the project manager (TH) for entry and analysis. All data were deidentified with subject- and site-specific coding before being forwarded.

Ethics approval for collation of these data was provided from the Medical Research Ethics Committee of the University of Queensland, the Queensland Health Human Research Ethics Committee, and the hospital human research ethics committees of each of the participating sites.

Data analysis 

Each site was paired with another site on the basis of the quantity of data provided (ie, the 2 sites that contributed the greatest amount of data were paired and so on) and then randomly divided into development or validation datasets. The purpose of splitting this data sample was to determine if the clinometric properties established from the development set were consistent with those of a validation dataset containing information independent of the development set. This step is important because a data-driven approach was used to construct the Balance Outcome Measure for Elder Rehabilitation (BOOMER) scaling, and previous research indicates that some clinometric properties can be biased if the validation dataset is not independent of the data from which the instrument was developed.36, 37

The primary rationale for scaling of the measures was to generate a measure that would be sensitive to change across the range of patients for whom this measure would be clinically applicable. To satisfy this design criterion, we considered that 2 conditions should be met. First, that the scale should be skewed toward poorer balance scores for the patient group clinically considered at the lowest end of the standing balance spectrum (geriatric rehabilitation inpatients at admission) for whom this measure would be applicable. Second, that the scale should be skewed toward superior balance scores for the patient group clinically considered at the highest end of the standing balance spectrum (geriatric rehabilitation outpatients and domiciliary patients at discharge) for whom this measure would be applicable. By meeting these 21 conditions, the measure will be sensitive to change when moving from 1 extreme to the other. A secondary rationale was to generate cutoff points that could readily be applied in the clinical setting; hence, we rounded off to the closest integer for the step test and to the closest number divisible by 5 for the remaining 3 tests. A tertiary rationale was to avoid ceiling and floor effects of the overall scale.

Internal consistency of the BOOMER (the extent to which individual items are correlated and are thus likely to be measuring the same construct) was assessed by using the Cronbach α.38 Interitem correlations for scaled items were calculated by using the Spearman ρ. Factor analysis (principal-factor approach) was conducted separately for admission and discharge data for development and validation datasets by using the 4 test items.

Many perspectives may be taken when assessing a minimum clinically significant change in a clinical characteristic including patient, clinician, societal, and institutional perspectives. In this investigation, the minimum clinically significant difference was evaluated by using 5 “anchor-based” approaches. Anchor-based approaches base the minimum clinically significant change on external criteria (anchors), which can be other measures or phenomena that have clinical relevance.39 The 5 approaches were (1) comparing BOOMER scores between admission and discharge assessments for inpatients discharged to the community (not residential aged care facility or other), (2) comparing admission BOOMER scores between those for whom improving standing balance was rated as a high priority goal as opposed to a low priority goal of treatment (as rated by treating physiotherapist), (3) comparing admission BOOMER scores between inpatients and outpatients (including domiciliary), (4) comparing discharge BOOMER scores between those whose residence on discharge was living in the community versus residential aged care or other destination (eg, acute hospital); and (5) comparing change in BOOMER scores (between admission and discharge) between those whose motor functional independence measure scores improved by 10 points or more and those that did not.

Each of these anchors are important from a range of perspectives and are either generally reflective of criteria delineating successful versus unsuccessful rehabilitation outcomes (eg, being discharged to the community as opposed to residential aged care or other location can be seen as being a more successful outcome) or factors that impact on clinical management (eg, patients for whom improving standing balance is identified as a high priority goal are more likely to have a greater proportion of their treatment time allocated to balance retraining). For the second approach, all potential cutoff points for the “improving balance goal priority” variable were examined in the development dataset to determine if BOOMER scores differed significantly between groups separated by this criterion. This cutoff was then applied to the validation dataset. Point estimates and 95% confidence intervals (CIs) for each of these differences were calculated for both development and validation datasets.

Phase 3 

Phase 3 established construct (criterion) validity. To do this, we used a prospective cohort (single site) design. Patients were selected from 1 site participating in phase 2 of this research program.

Criterion reference tests were the Modified Elderly Mobility Scale (MEMS),40 a measure of mobility among older adults, and the FIM motor score,33 a measure of independence in performing functional tasks. Because standing balance is a construct closely related to mobility and function,4, 23, 41, 42 these measures were considered appropriate criterion reference tests with which to assess the validity of the BOOMER. The MEMS,40 commonly used within Queensland, Australia, is a measure of global function comprising 8 items including lying to sitting, sitting to standing, walking, functional reach, and stairs. The MEMS has shown criterion-related validity with the FIM motor score and interrater and test-retest reliability.

The MEMS and FIM motor scores were collected at this site at patient admission and discharge along with demographic details and other test items that constituted the BOOMER. This was a component of routine care at this site. Data were not examined until after formation of the BOOMER so that knowledge of the data distribution would not influence the design of the BOOMER.

Data analysis 

The correlation between the MEMS and the FIM motor with the BOOMER was examined by using Spearman ρ correlation coefficient analysis. Admission and discharge scores were analyzed separately. All statistical analyses were conducted by using Stata.43,a

Results 

return to Article Outline

Phase 1 

Responses to the phase 1 survey were received from 16 of the 22 QRPN member sites contacted. Eight tests (step test, TUG test, FRT, static standing with eyes closed and feet together, static tandem stance, single-leg stance, standing with feet together and eyes open, ten-meter walk test) were currently used by at least 11 of the 16 sites. One site responded that they would not be willing to use 2 of the tests (static tandem stance, single-leg stance) as a part of their standard practice. From this list, the expert panel selected the step test, TUG test, FRT, and static standing with eyes closed. The expert panel deemed that these tests encompassed a range of static and dynamic tasks incorporating a range of functional activities including both motor and sensory challenges to balance and that these tests adequately sampled the range of domains that constitutes the overall standing balance construct while preserving the desire for the tool to be practically applicable.

Phase 2 

Data from 1769 participants across 18 sites (16 from Queensland, 2 from Victoria) were collated. Data from 1 site did not include 1 of the 4 balance measures and were not allocated to either the development or validation dataset. Patient demographics of these datasets are presented along with the demographics of patients contributing data to phase 3 (table 1).

Table 1.

Baseline Demographics of Participants in Development and Validation Datasets and for the Single Site in Phase 3 of the Study

DemographicDevelopmentValidationPhase 3
N683784272
Age (y)74±1474±1375±14
Men (n)271(39.6)315(40.2)105(38.6)
Diagnosis (n)
Stroke155(24)176(23)66(24)
Other neurologic condition88(13)52(7)37(14)
Elective orthopedic34(5)72(9)2(1)
Other orthopedic203(30)200(26)82(30)
Other set diagnoses combined125(18)188(24)53(19)
Other geriatric impairment§67(10)94(12)32(12)
Aid before admission nothing or single point stick527(77)550(70)194(71)
Living arrangements before admission community dwelling651(96)721(92)255(94)
Mini-Mental State Examination (/30)25±524±624±5
Admission FIM motor (/91)59±1754±1758±17
Discharge FIM motor (/91)75±1573±1675±14
Length of stay (d)40±3232±2350±32

NOTE. Values are mean±standard deviation or frequency (%).

Refers to neurologic conditions other than stroke (eg, Parkinson’s disease).

Refers to orthopedic diagnoses that were not elective, for example, fractured neck of femur.

A merged category including the categories of amputation, spinal cord injury, arthritis, pain, musculoskeletal, major multiple trauma, cardiac, pulmonary, and burns.

§

Category for conditions associated with older age not adequately described by the above categories (eg, deconditioning and poor mobility).

To scale each item, 4 cutoff points were selected creating 5 ordinal categories (scores range, 0−4). An overall score was created by summing the scores for each item (scores range, 0−16). This was initially conducted by using the admission dataset for inpatients in the development dataset on the basis of generating a skewed data distribution toward poorer scores. This was then assessed graphically by using frequency histograms. The cutoff points were applied to the discharge scores of the outpatient and domiciliary patients of the development dataset combined to examine whether a skew toward higher scores had been attained. This was again checked graphically, and modifications to the selected cutoff points were made when deviations from the desired properties were observed (for both individual items and the overall scale) before repeating the process. Multiple iterations of this process were pursued before settling on the final set of cutoff points (table 2), which the authors determined best satisfied the 3 rationale for item scaling.

Table 2.

Cutoff Points for BOOMER Items

Test01234
Step test (average no. of steps)Unable>0–5>5–8>8–12>12
TUG test (s)Unable≥30<30–20<20–10<10
FRT (cm)0>0–15>15–20>20–30>30
Static standing eyes closed (s)Unable>0–30>30–60>60–<9090

The distribution of each item for inpatient admission and outpatient and domiciliary discharge scores arising from the development and validation datasets are presented (fig 1). These frequency histograms indicated that the primary rationale for item scaling had been met as skew toward poorer scores for admission inpatient data, and skew toward superior balance scores for discharge outpatient and domiciliary data were evident in both individual items and the overall BOOMER scale (fig 2). One possible exception (the static standing with eyes closed and feet together item) was evident because it showed a bimodal distribution for admission inpatient scores. A mild floor effect was also apparent when examining the admission inpatient datasets. Comparison between development and validation datasets indicated that the data distributions were consistent.


View full-size image.

Fig 1. Distribution of scores for each item from inpatient admission and outpatient and domiciliary discharge data.



View full-size image.

Fig 2. Distribution of BOOMER scores from inpatient admission and outpatient and domiciliary discharge data.


The internal consistency of the items (α) ranged from .87 to .89 for the admission and discharge assessments of the development and validation datasets. Interitem correlations (r) ranged from .77 to .51, with the lowest interitem correlations consistently being found in those involving the static standing with eyes closed and feet together test (table 3). Factor analysis resulted in single-factor solutions for admission and discharge data of development and validation datasets (analyzed separately). Uniqueness (1 – communality) varied for each of the following items: step test (.29−.35), TUG test (.27−.35), FRT (.32–.40), and static stance eyes closed (.48−.55).

Table 3.

Interitem Correlation (Spearman ρ) for BOOMER Items at Admission and Discharge Across Development and Validation Datasets

Step TestTUG TestFRT
DevelopmentAdmissionTUG test.71
FRT.72.73
Static standing eyes closed.65.64.68
DischargeTUG test.77
FRT.66.74
Static standing eyes closed.52.52.53
ValidationAdmissionTUG test.70
FRT.67.65
Static standing eyes closed.59.61.66
DischargeTUG test.77
FRT.65.69
Static standing eyes closed.56.51.52

Point estimates and 95% CIs for each of the 5 approaches to calculating the minimum clinically significant difference are presented (fig 3). For the second approach (comparing those for whom standing balance was rated as a higher- or a lower-ranked goal), the cutoff point of “fourth or lower-ranked problem” was identified and used. The differences ranged between 2 and 6 points on the BOOMER scale, with 95% CIs overlapping between 3 to 4 points. The minimum clinically significant differences identified by using each approach were consistent between development and validation datasets.


View full-size image.

Fig 3. Point estimates and 95% CIs for minimum clinically significant differences for the BOOMER calculated from development and validation datasets. Data presented are point estimate and 95% CI of the mean difference between groups. *Discharge BOOMER – admission BOOMER (within subjects), for inpatients discharged to the community. Admission BOOMER for subjects for whom improving standing balance was a fourth- or lower-ranked treatment goal – admission BOOMER for subjects who had improving standing balance as 1 of their top 3 treatment goals (between subjects). Admission BOOMER for outpatient or domiciliary patients – admission BOOMER for inpatients (between subjects). §Discharge BOOMER for patients whose living status at discharge was the community – discharge BOOMER for patients whose living status at discharge was not the community (eg, nursing home, hospital) (between subjects). Change in BOOMER (discharge – admission) for people whose improvement in FIM motor score was 10 points or more – change in BOOMER (discharge – admission) for people whose improvement in FIM motor score was less than 10 points (between subjects).


Phase 3 

The BOOMER showed good correlation with both the motor FIM (admission data, ρ=.73; discharge data, ρ=.72) and the MEMS (admission data, ρ=.88; discharge data, ρ=.83). Each of these associations was statistically significant (P<.001).

Discussion 

return to Article Outline

The BOOMER is a new measure of standing balance impairment designed specifically for practical implementation in elder rehabilitation health care settings. We contend that the BOOMER balances the competing needs of a tool to adequately sample and scale a range of items to ensure the construct validity of the instrument while also being practically applicable for both clinical and research settings.

Establishing the validity of a measurement tool can be viewed as an ongoing process requiring many studies across a range of patient groups and clinical settings. Through this first investigation, we are confident in many of the BOOMER’s properties. The item identification and selection process provides evidence that the BOOMER has content validity and is likely to be practically applicable. This is partly based on an assumption that physiotherapists will not use measures they do not believe to be both valid and practically applicable in clinical practice and that only those meeting both criteria will have been selected during phase 1 of this project. The expert panel added to this selection process and encompassed expertise from senior rehabilitation physiotherapists from a range of rehabilitation facilities.

BOOMER items measure various constructs of balance including static balance, reduced vision (static standing with feet together and eyes closed), forward reach during a self-generated perturbation (FRT), dynamic single-limb stance task (step test), and rising and lowering from a chair, walking, and turning, which are tasks that potentially threaten balance (TUG test). For an instrument to be a valid measure of a construct, it is important that its constituent items be consistent with each other because divergences from this indicate that individual items may be measuring different constructs. The internal consistency of the BOOMER as identified in phase 2 was high and consistent between admission and discharge assessments for both the development and validation datasets. Appropriate ranges for the Cronbach α vary depending on the purpose of the tool. For comparing groups and formative research, α ranges between 0.7 and 0.8 are satisfactory, with higher values possibly indicating redundancy of information provided by 1 or more of the items.44, 45 Yet, for clinical applications, higher scores at the level of 0.9 have been argued as being necessary.45 Closer examination of the interitem correlation indicated that the static standing with eyes closed and feet together test was less closely related than the remainder of the tests. Clinically, this is not surprising because this item provided a primarily “sensory” challenge to standing balance, whereas the remaining tests could be characterized as providing “motor” challenges. The bimodal distribution for this item (particularly for admission inpatient data) may also have contributed to this. The removal of this item from the BOOMER would likely enhance its internal consistency beyond α equal to 0.9, yet this would be at the cost of content validity because this was the only item with a primarily sensory challenge to balance. Hence, we argue for the retention of this item while recognizing that its retention is likely to hold internal consistency marginally below preferred levels for clinical application. Factor analysis further justified item selection in that each item loaded on the same single factor, yet each provided substantial uniqueness such that we did not consider there to be excessive overlap between items.

Another mechanism for determining the construct validity of a measurement scale is to compare this scale with scales that measure similar, related constructs. In this study, we compared the BOOMER (a measure of static and dynamic standing balance) with the MEMS (a measure of mobility) and the FIM motor score (a measure of functional independence). Correlations between the BOOMER and the MEMS and FIM motor score were high across admission and discharge scores within both development and validation datasets. This high yet imperfect degree of correlation indicates that these instruments are measuring similar yet separate constructs and provides additional evidence toward the construct validity of the BOOMER.

The minimum clinically significant change of the BOOMER identified in this investigation is arguably 3 points on the 0- to 16-point BOOMER scale. This minimum clinically significant difference was based on 2 of the anchors examined. The first was important from the clinician perspective, being whether improving standing balance was 1 of the top 3 clinical problems from a PT perspective. This is of clinical importance because it will impact the content of the rehabilitation provided by the physical therapist. The second, being whether the patient exhibited a 10-point change in FIM motor score or not, is important from patient and societal perspectives because previous research46 has found that an improvement in the FIM score by 10 points reduced the time required to care for a group of community-dwelling stroke patients by half. Other criteria identified differences in excess of 3 points on the BOOMER; however, the intent of this investigation was to identify the minimum clinically significant difference. Thus, with the lower anchor point criteria recognized as being clinically relevant, it is this 3-point difference that should act as the minimum clinically significant change. Identifying the minimum clinically significant difference of a scale is important because large trials may identify statistically significant differences that are so small in magnitude they are not of clinical importance.

Study Limitations 

The analysis did raise a concern that the BOOMER may be subject to a floor effect and, thus, potentially be unable to discriminate between subjects with greatly diminished standing balance ability and be insensitive to change across the rehabilitation period. In considering this, it should first be noted that this study received data from facilities and services primarily providing inpatient rehabilitation. In Australia, geriatric inpatient rehabilitation is commonly provided to older adults who have become medically stable after a recent illness or surgery and require an extended rehabilitation period before returning to their place of residence. Consequently, intensive therapy focused on improving standing balance commonly commences after admission to the rehabilitation wards. It is not surprising then that many people do not have sufficient standing balance to register more than the minimum score on this standing balance scale at this time.

Another study limitation was that we considered only existing standing balance measures. It is plausible that as-yet-undeveloped measures of standing balance could better measure the standing balance construct. However, pursuing only existing assessment strategies, particularly those already in use in clinical practice, is likely to enhance the practical applicability and clinician acceptability of the BOOMER.

Conclusions 

return to Article Outline

Future research is required to investigate other clinometric properties of the BOOMER. Comparison to other balance scales that provide a global measure of balance, such as the Berg Balance Scale, is necessary to further understand the construct validity of the BOOMER. The resource requirements and patient burden for completing these alternative measures should also be considered in these comparisons. The sensitivity to change of the BOOMER should be investigated in comparison to other scales, preferably in a prospective study with multiple follow-up points during the rehabilitation period. Interrater reliability is likely to be high given previous work on the constituent items of the BOOMER, yet it would still be beneficial to examine this assumption, particularly because the functional reach item was modified from the original description and the static standing with eyes closed and feet together item was extracted from a larger test battery.

Supplier

References 

return to Article Outline

1. 1Lamb SE, Ferrucci L, Volapto S, Fried LP, Guralnik JM. Risk factors for falling in home-dwelling older women with stroke: the Women’s Health and Aging Study. Stroke. 2003;34:494–501. CrossRef

2. 2Oliver D, Daly F, Martin FC, McMurdo ME. Risk factors and risk assessment tools for falls in hospital in-patients: a systematic review. Age Ageing. 2004;33:122–130. MEDLINE | CrossRef

3. 3Perell K, Nelson A, Goldman R, Luther SL, Prieto-Lewis N, Rubenstein LZ. Fall risk assessment measures: an analytic review. J Gerontol A Biol Sci Med Sci. 2001;56:M761–M766. MEDLINE

4. 4Shimada H, Obuchi S, Kamide N, Shiba Y, Okamoto M, Kakurai S. Relationship with dynamic balance function during standing and walking. Am J Phys Med Rehabil. 2003;82:511–516. MEDLINE | CrossRef

5. 5Stevens JA, Corso PS, Finkelstein EA, Miller TR. The costs of fatal and non-fatal falls among older adults. Inj Prev. 2006;12:290–295. MEDLINE | CrossRef

6. 6Moller J. Changing health resource demands for injury due to falls in an ageing population. N S W Public Health Bull. 2002;13(1-2):3–6. MEDLINE | CrossRef

7. 7Konrad HR, Girardi M, Helfert R. Balance and aging. Laryngoscope. 1999;109:1454–1460. CrossRef

8. 8Woollacott MH, Tang PF. Balance control during walking in the older adult: research and its implications. Phys Ther. 1997;77:646–660. MEDLINE

9. 9Cowley A, Kerr K. A review of clinical balance tools for use with elderly populations. Crit Rev Phys Rehabil Med. 2003;15:167–205.

10. 10Tyson S, DeSouza L. Measurement of balance and walking post-stroke: Part 1. Ordinal scales. Phys Ther Rev. 2002;7:177–185.

11. 11Tyson S, DeSouza L. Measurement of balance and walking post-stroke: Part 2. Functional performance tests. Phys Ther Rev. 2002;7:87–91.

12. 12Tyson S, DeSouza L. Measurement of balance poststroke: Part 3. Instrumented measurement tools. Phys Ther Rev. 2002;7:231–241.

13. 13Duncan P, Weiner D, Chandler J, Studenski S. Functional reach: a new clinical measure of balance. J Gerontol. 1990;45:M192–M197. MEDLINE

14. 14Hill K, Bernhardt J, McGann A, Maltese D, Berkovits D. A new test of dynamic standing balance for stroke patients: reliability, validity and comparison with healthy elderly. Physiother Can. 1996;48:257–262. CrossRef

15. 15Cohen H, Blatchly CA, Gombash LL. A study of the clinical test of sensory interaction and balance. Phys Ther. 1993;73:346–351discussion 351-4. MEDLINE

16. 16Anacker S, Di Fabio R, Horak F. Influence of sensory inputs on standing balance in community-dwelling elders with a recent history of falling. Phys Ther. 1992;72:575–581. MEDLINE

17. 17Berg K, Wood-Dauphinee S, Williams J. The Balance Scale: reliability assessment with elderly residents and patients with an acute stroke. Scand J Rehabil Med. 1995;27:27–36. MEDLINE

18. 18Steffen TM, Hacker TA, Mollinger L. Age- and gender-related test performance in community-dwelling elderly people: Six-Minute Walk Test, Berg Balance Scale, Timed Up & Go Test, and gait speeds. Phys Ther. 2002;82:128–137. MEDLINE

19. 19Duncan P, Studenski S, Chandler J, Prescott B. Functional reach: predictive validity in a sample of elderly male veterans. J Gerontol. 1992;47:M93–M98. MEDLINE

20. 20Nitz JC, Choy NL, Isles RC. Medial-lateral postural stability in community-dwelling women over 40 years of age. Clin Rehabil. 2003;17:765–767. MEDLINE | CrossRef

21. 21Lord SR, Ward JA. Age-associated differences in sensori-motor function and balance in community dwelling women. Age Ageing. 1994;23:452–460. MEDLINE

22. 22Lord SR, March LM, Cameron ID, et al. Differing risk factors for falls in nursing home and intermediate-care residents who can and cannot stand unaided. J Am Geriatr Soc. 2003;51:1645–1650. MEDLINE | CrossRef

23. 23Curb JD, Ceria-Ulep CD, Rodriguez BL, et al. Performance-based measures of physical function for high-function populations. J Am Geriatr Soc. 2006;54:737–742. MEDLINE | CrossRef

24. 24Podsiadlo D, Richardson S. The timed “Up & Go”: a test of basic functional mobility for frail elderly persons. J Am Geriatr Soc. 1991;39:142–148. MEDLINE

25. 25Dite W, Temple V. A clinical test of stepping and change of direction to identify multiple falling older adults. Arch Phys Med Rehabil. 2002;83:1566–1571. Abstract | Full Text | Full-Text PDF (83 KB) | CrossRef

26. 26Weiner D, Bongiorni D, Studenski S, Duncan P, Kochersberger G. Does functional reach improve with rehabilitation?. Arch Phys Med Rehabil. 1993;74:796–800. MEDLINE | CrossRef

27. 27Weiner D, Duncan P, Chandler J, Studenski S. Functional reach: a marker of physical frailty. J Am Geriatr Soc. 1992;40:203–207. MEDLINE

28. 28Hill K, Miller K, Denisenko S, Clements T, Batchelor F. Manual for clinical outcome measurement in adult neurological physiotherapy. 2nd ed.. Melbourne: Aust Physiotherapy Association Neurology Special Group (Victoria); 2001;.

29. 29Billek-Sawhney B, Gay J. The functional reach test: are three trials necessary?. Top Geriatr Rehabil. 2005;21:144–148.

30. 30Di Fabio R, Anacker S. Identifying fallers in community living elders using a clinical test of sensory interaction for balance. Eur J Phys Med Rehabil. 1996;6:61–66.

31. 31Cho BL, Scarpace D, Alexander NB. Tests of stepping as indicators of mobility, balance, and fall risk in balance-impaired older adults. J Am Geriatr Soc. 2004;52:1168–1173. MEDLINE | CrossRef

32. 32Folstein M, Folstein S, McHugh P. ”Mini-mental state.” A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–198. MEDLINE | CrossRef

33. 33Linacre JM, Heinemann AW, Wright BD, Granger CV, Hamilton BB. The structure and stability of the Functional Independence Measure. Arch Phys Med Rehabil. 1994;75:127–132. MEDLINE

34. 34Shah S, Vanclay F, Cooper B. Improving the sensitivity of the Barthel Index for stroke rehabilitation. J Clin Epidemiol. 1989;42:703–709. MEDLINE | CrossRef

35. 35Eagar K, Gordon R, Hodgkinson A, Eagar K, Green J, de Graaff S. The Australian National Sub-Acute and Non-Acute Patient Classification (AN-SNAP): report of the National Sub-Acute and Non-Acute Casemix Classification Study. Wollongong: Centre for Health Service Development, Univ Wollongong; 1997;.

36. 36Lijmer JG, Mol BW, Heisterkamp S, et al. Empirical evidence of design-related bias in studies of diagnostic tests. [published erratum in: JAMA 2000;283:1963] JAMA. 1999;282:1061–1066. MEDLINE | CrossRef

37. 37Haines T, Hill K, Walsh W, Osborne R. Design-related bias in hospital fall risk screening tool predictive accuracy evaluations: systematic review and meta-analysis. J Gerontol A Biol Sci Med Sci. 2007;62:664–672. MEDLINE

38. 38Cronbach L. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16:297–334. CrossRef

39. 39Crosby RD, Kolotkin RL, Williams GR. Defining clinically meaningful change in health-related quality of life. J Clin Epidemiol. 2003;56:395–407. Abstract | Full Text | Full-Text PDF (147 KB) | CrossRef

40. 40Kuys S, Brauer S. Validation and reliability of the Modified Elderly Mobility Scale. Australas J Ageing. 2006;25:140–144.

41. 41Duncan PW, Chandler J, Studenski S, Hughes M, Prescott B. How do physiological components of balance affect mobility in elderly men?. Arch Phys Med Rehabil. 1993;74:1343–1349. MEDLINE | CrossRef

42. 42Steadman J, Donaldson N, Kalra L. A randomized controlled trial of an enhanced balance training program to improve mobility and reduce falls in elderly patients. J Am Geriatr Soc. 2003;51:847–852. MEDLINE | CrossRef

43. 43StataCorp. Stata statistical software. College Station: Stata Pr; 2003;.

44. 44Nunally J. Psychometric theory. 2nd ed.. New York: McGraw-Hill; 1978;.

45. 45Bland JM, Altman DG. Cronbach’s alpha. Br Med J. 1997;314:572.

46. 46Granger CV, Cotter AC, Hamilton BB, Fiedler RC. Functional assessment scales: a study of persons after stroke. Arch Phys Med Rehabil. 1993;74:133–138. MEDLINE

a Division of Physiotherapy, School of Health and Rehabilitation Sciences, University of Queensland, Brisbane, Australia

b Physiotherapy Department, Princess Alexandra Hospital, Brisbane, Australia

c School of Physiotherapy and Exercise Science, Griffith University, Gold Coast, Australia

d St Andrews War Memorial Hospital, Brisbane, Australia

e The Prince Charles Hospital, Brisbane, Australia.

Corresponding Author InformationReprint requests to Terry Haines, PhD, Physiotherapy Department, GARU, Princess Alexandra Hospital, Ipswich Rd, Woolloongabba, QLD 4102, Australia

 Supported by the University of Queensland New Staff Research Fund.

 No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit upon the author(s) or upon any organization with which the author(s) is/are associated.

a Version 8.0; StataCorp, 4905 Lakeway Dr, College Station, TX 77845.

PII: S0003-9993(07)01563-8

doi:10.1016/j.apmr.2007.09.012


View previous. 11 of 46 View next.