| | Rasch Analysis of the Barthel Index in the Assessment of Hospitalized Older Patients After Admission for an Acute Medical ConditionAbstract de Morton NA, Keating JL, Davidson M. Rasch analysis of the Barthel Index in the assessment of hospitalized older patients after admission for an acute medical condition. ObjectiveTo investigate the validity of item score summation for the original and modified versions of the Barthel Index. DesignRasch analysis of Barthel Index data. SettingGeneral medical wards at 2 acute care hospitals in Australia. ParticipantsConsecutive older medical patients (N=396). InterventionsNot applicable. Main Outcome MeasuresActivity limitation was assessed by using the Barthel Index at hospital admission and discharge. At 1 hospital site, the original Barthel Index was used, and at the other hospital site the Modified Barthel Index (MBI) was used. ResultsMore than half of the items showed misfit to the Rasch model for both versions of the Barthel Index. The continence items appear to measure a different construct to the other items. After the removal of the continence items, data for the remaining items still did not fit the Rasch model. Neither the original nor the MBI are unidimensional scales. An exception to this occurred when the original Barthel Index was rescored and only then for discharge and not for admission Barthel Index data. ConclusionsBecause clinicians do not typically rescore outcomes obtained by using the Barthel Index, these findings, combined with unacceptable ceiling effects, render the Barthel Index an assessment tool with limited validity for measuring and monitoring the health of older medical patients. THE BARTHEL INDEX WAS developed by Mahoney and Barthel in 19651 to assess the nursing care requirements of people undertaking rehabilitation by measuring the performance of activities of daily living (ADLs) including grooming, bathing, feeding, getting on and off the toilet, ascending and descending stairs, dressing, bladder and bowel continence, walking, and transferring. The Barthel Index has since been reported to be the best and most widely used measure of ADL function.2 It is now applied as an outcome measure in many differing patient populations and has been recommended by the Royal College of Physicians for routine use in the assessment of older people.3 Outcome measures with adequate clinimetric properties are required for objectively monitoring individual patient change and for investigating the effects of interventions. In the acute hospital setting, Barthel Index items provide important information regarding the likely care needs of people during hospitalization and readiness for discharge information. However, there are reasons to clarify the validity of applying the Barthel Index total score as a method for measuring and monitoring changes in activity limitation during acute hospitalization. In a summary of the measurement properties of the Barthel Index, Shah4 reported the Barthel Index to be a unidimensional scale (ie, items measure 1 common underlying construct) that provides interval level data. Similarly, Hsieh and Hsueh5 reported a single-factor structure for the original version of the Barthel Index after factor analysis of data collected from patients after stroke. However, by using Rasch analysis, researchers have repeatedly concluded that when the Barthel Index is used to assess neurologic rehabilitation populations, it is not unidimensional and the data do not display interval qualities.6, 7, 8 The current literature offers 2 different approaches to assessing scale dimensionality.9 In a classical test theory approach, factor analysis or principal components analysis would identify items that cluster on 1 factor that explains the majority of the variance in the scale. Alternatively, Rasch analysis, based on item response theory, tests a set of items that are ordered in a hierarchy of difficulty from easiest to hardest for fit to a probabilistic linear scale. Item response theory asserts that scale items can only be validly summated to provide a total score if the scale is unidimensional. Rasch modeling was the preferred method in this study because it has many reported advantages compared with the classical test theory approach.9 Rasch analysis offers a method for examining if an item set forms a valid unidimensional scale. If the items satisfy the expectations of the Rasch model, scale scores can be transformed to interval level measurement. The Rasch model also places items in a hierarchy based on item difficulty, allowing ranking of items from easiest to hardest. Rasch analysis facilitates examination of the hierarchy of item difficulty and can therefore identify items of similar difficulty that may provide redundant data. The same process allows gaps to be identified if items are missing from the item hierarchy. In these areas of the scale, scores may not adequately capture transition across the spectrum of abilities, resulting in large improvements being required before change registers for some levels of ability. Properties of ordinal measurement scores are important because they influence interpretation and application of test scores. The ability to show change on an ordinal scale may vary depending on where a person is initially located on the measurement continuum.10 For example, it is common for a person on an ordinal measurement scale to have greater difficulty to achieve a change in score if they have either very poor ability or very high ability on the construct of interest (ie, they are located at the scale extremes).10 Therefore, change scores for ordinal measures need to be interpreted with caution because if nonequal intervals exist between adjacent scores, change scores for patients with different levels of ability may misrepresent the amount of change, or fail to detect change, in the underlying construct of interest. Despite the widespread use of the Barthel Index in the acute hospital setting, Rasch analysis of Barthel Index data has not been previously conducted for patients in this setting. Only measurements obtained with the Modified Barthel Index (MBI) have been subjected to Rasch analysis,6, 7, 8, 11 but this has not been performed with data collected from patients in the acute hospital setting. This study was a secondary analysis of original and MBI data that was initially obtained to conduct an individual patient data meta-analysis.12 This study aimed to (1) investigate the validity of item-score summation for the original Barthel Index and MBI for older acute medical patients and (2) investigate and compare the Rasch measurement properties of the original Barthel Index and MBI. Methods  Participants Barthel Index data for Rasch analysis were obtained from participants who were enrolled in the exercise intervention studies reported by de Morton13 and Jones14 and colleagues. Data were collected at 2 acute care public hospitals in Melbourne, Australia. Participants were eligible for inclusion if they were a general medical patient, were 65 years or older, and were assessed within 48 hours of hospital admission. Patients were excluded if they were admitted to the hospital from a nursing home, were nursing home level of care, required palliative care, had suffered a stroke, had a condition for which mobilization was contraindicated, or were readmitted during the data-collection period and had previously participated in the study. Data were collected from 236 and 160 consecutive older acute medical patients admitted to the 2 hospitals. Ethics approval to perform deidentified secondary analyses was obtained from the Monash University Ethics Committee and relevant Hospital Ethics Committees. Procedure The Barthel Index was administered by using patient self-report in response to a face-to-face interview with a physiotherapist within 48 hours of hospital admission and again within 48 hours of hospital discharge. Where there was uncertainty about the cognitive ability of the patient, the Barthel Index was administered by asking nursing staff or family. Outcome Measures The original Barthel Index,1 which is comprised of two 2-point dichotomous, six 3-point, and two 4-point response option items, was administered at 1 hospital. At the other hospital, a modified version of the Barthel Index15 was used (ten 5-point response option items). The response options for these differing versions of the Barthel Index are shown in table 1. Rasch Analysis The Rasch model is a probabilistic model that asserts that an item response is a result of an interaction between person ability (ß) and item difficulty (θ).10 This model assumes that the probability that a person of a particular ability will successfully complete an item is a logistic function of the difference between person ability (eg, level of ADL ability) and item difficulty (eg, difficulty of ADL challenge). If ß equals θ, the probability of success is 0.5. As ß (person ability) increases and exceeds θ (item difficulty), the probability of success increases. If data fit the Rasch model, the person responses are expected to be consistent with the order of item difficulty because they are measuring 1 construct,10 and this consistency is indicated by a total-item chi-square probability greater than .05. The Rasch model has 2 assumptions. The first requires that items measure a single underlying construct, thereby forming a unidimensional scale. The second assumption is that items have local independence. This requires that responses to 1 item are not dependent on the responses to another item or that “after controlling for the underlying trait, item responses are independent.”16(p147) Residuals from Rasch analysis were examined to investigate these assumptions. A finding of no association between residuals for individual items has been argued as evidence of local item independence.17 High positive correlation between residuals provides evidence of local item dependence, and high negative correlations is thought to indicate multidimensionality. Unidimensionality was formally tested by examining the principal component loadings of the residuals.17, 18 Items were identified that loaded either positively or negatively on the first residual component after Rasch analysis (correlated at >0.3), and 2 item subsets were created. Person-location estimates were obtained by using the differing item subsets, and a series of independent t tests were conducted to compare person-location estimates obtained by using the differing item subsets. The percentage of t tests outside the acceptable range of 2 standard deviations (SDs) was then calculated with an accompanying binomial proportions 95% confidence interval (CI).19 A result of less than 5% is reported as the most robust method for confirming scale unidimensionality.17, 18 Rasch analysis was performed in this study by using RUMM2020 softwarea and SPSS software.b The unrestricted Rasch partial credit model was used to investigate overall model fit, item misfit, differential item functioning (DIF), and item thresholds. Participants were divided into 3 class intervals (ie, 3 groups of patients at different levels of ADL ability). Item misfit was considered if the chi-square or F statistic probability value was less than the Bonferroni-adjusted α value for multiple testing or the fit residuals were greater than ±2. DIF was investigated to identify items that operated differently for people of the same level of ability (ie, have the same amount of the underlying trait of interest) who share another feature (variable). DIF by age (65–70y, 80–86y, >87y) and sex were investigated and considered significant if the chi-square probability value was lower than the Bonferroni-adjusted P value. For both versions of the Barthel Index, item thresholds were also studied to investigate the existence of disordered thresholds (ie, response patterns for item categories that are not in the expected order). The person separation index was reported to provide an indication of the internal consistency (reliability) of the scale by examining the ability of the scale construct to discriminate among respondents. Sample-Size Calculations Linacre20 proposed that for most purposes a sample size of 150 (n range, 108–243) will provide 99% confidence of item calibration ±0.5 logits and a sample size of 100 (n range, 64–144) will provide 95% confidence within ±0.5 logits. However, the adequacy of test targeting influences sample-size requirements for Rasch analysis. For well-targeted tests, a sample size of 108 is likely to provide 99% confidence within ±0.5 logits.20 Conversely, for tests that are not well targeted, a larger sample size (eg, n=243) is required for similar item location precision.20 In this study, hospital admission data were planned for analysis for the original (n=236) and MBI (n=160). However, hospital discharge assessments were also planned for analysis to validate findings on an independent dataset. Preliminary examination of the range of participant scores in this study indicated that more than 10% of both samples would score within the minimal detectable change (≈10 points on the Barthel Index) of the highest possible Barthel Index score (ie, ≥90 points).12, 13, 14 However, in this study, the relationship under investigation was the proportion of persons that were unable to be located on the Barthel Index continuum after Rasch analysis. Each item location has an error estimate and surrounding 95% CI on the Rasch converted logit scale. A ceiling effect was therefore considered to exist if more than 10% of persons had a logit location higher than the upper band of the 95% CI of the standard error (SE) of measurement for the most difficult item. Results  In the trial reported by de Morton et al,13 the mean patient age was 78.9±7.5 years. Fifty-five percent were women, and the mean admission Barthel Index score (original version) was 67.3±26.1. In the trial reported by Jones et al,14 the mean patient age was 82.4±7.8 years, 57% were women, and the mean admission Barthel Index score (modified version) was 63.8±23.1. At both hospital sites, admission Barthel Index scores ranged from 0 to 100 out of a maximum possible score of 100. Barthel Index data were obtained for 71% (167/236) and 79% (126/160) of patients before discharge in the trial reported by de Morton and Jones, respectively. At both hospital sites, there were no significant differences for age, sex, prior place of residence, or admission activity limitation scores between patients who were and were not lost to follow-up. Rasch Analysis of the Original and Modified Barthel Index For the original Barthel Index, there were 236 admission assessments performed, and there were 40 extreme-person responses. These persons scored either the lowest (n=2) or highest (n=38) possible score on the Barthel Index and, therefore, have a theoretical ability to represent either −∞ or ∞ on the Rasch logit scale. A total-item chi-square test for item-trait interaction indicated that original Barthel Index data did not fit the Rasch model (χ202=94.17, P=.00). For the MBI, there were 160 admission assessments performed, and there were 3 extreme-person responses. A total-item chi-square test for item-trait interaction indicated that the MBI data did not fit the Rasch model (χ202=153.21, P=.00). A significant likelihood ratio test (χ262=199.15, P=.00) combined with visual inspection of the item-threshold categories justified the use of the unrestricted Rasch partial credit model for the MBI. The person separation index for both versions of the Barthel Index was high, .91 for the original Barthel Index and .94 for the MBI. This indicates a high ability of the Barthel Index to discriminate among respondents. Both versions of the Barthel Index showed evidence of multidimensionality. For the original version of the Barthel Index, the percentage of individual t tests outside the acceptable range was 12.24% (95% CI, 9–15), and for the modified version, it was 29.94% (95% CI, 27–33). Item Misfit Many items for both versions of the Barthel Index showed misfit to the Rasch model. Table 2 summarizes the results of Rasch analysis in this study. The bladder item for the original Barthel Index and the bladder and bowel items for the MBI had fit residuals of greater than 2, indicating the likely measurement of another construct. Multiple items for each version of the Barthel Index had fit residuals that exceeded −2, indicating item redundancy and that these items over discriminate. Items in both versions of the Barthel Index also had a significant chi-square and/or F statistic probability value that indicated that the observed proportions deviated significantly from the Rasch model on the basis of person scores and/or class intervals. | | |  | | Admission Data | Admission Data After Rescoring Disordered Thresholds |  |
|---|
 | Original Barthel Index | MBI | Original Barthel Index | MBI |  |
|---|
 | n | 236 | 160 | 236 | 160 |  |  | Extreme persons | 40 | 3 | 44 | 3 |  |  | Fit to the Rasch model (df=20) | χ2=94.17, P=.00 | χ2=153.21, P=.00 | χ2=53.06, P=.00 | χ2=161.06, P=.00 |  |  | Misfitting items | | | | |  |  | High positive fit residual | Bladder (2.01) Bowel (2.63) | >Bladder (4.28) | >None | >Bladder (4.14) |  |  | High negative fit residuals | Transfers (−3.21) Toilet use (−2.01) | Toilet use (−3.92) Bathing (−3.13) Dressing (−2.63) Walking (−2.01) | Toilet use (−2.81) Dressing (−2.22) Transfers (−2.13) | Toilet use (−3.85) Bathing (−3.04) Dressing (2.52) |  |  | Significant χ2P | Bowels (P=.02) Transfers (P=.00) Bladder (P=.00) Walking (P=.00) | Toilet use (P=.00) Bladder (P=.00) | Bladder (P=.00) | Toilet use (P=.00) Bladder (P=.00) |  |  | Significant F statistic P | Transfers (P=.00) Toilet use (P=.00) Walking (P=.00) | Toilet use (P=.00) Bathing (P=.00) Dressing (P=.00) Walking (P=.00) Transfers (P=.00) Bladder (P=.00) | Toilet use (P=.00) | Toilet use (P=.00) Bathing (P=.00) Dressing (P=.00) Bladder (P=.00) |  |  | Disordered thresholds | Feeding, stairs, bowels, walking | Stairs, bowels | None | None |  |  | DIF | None | Toilet use: uniform DIF by age (F=10.57, P=.00) | None | Toilet use: uniform DIF by age (F=10.32, P=.00) |  |  | Unidimensionality (independent t test) (%) | 12.24 (9–15) | 29.94 (27–33) | 6.25 (3–9) | 15.29 (12–19) |  | | | |
No DIF was identified for the original version of the Barthel Index. The toilet-use item showed significant uniform DIF by age for the MBI (F=10.57, P=.00). Persons in the youngest category (range, 65–79y) had a significantly higher probability of independence for this item compared with those in the middle age group (range, 80–86y). Exploratory Rasch analysis showed that original Barthel Index data did not fit the Rasch model after the misfitting bladder item had been removed (χ182=69.66, P=.00) or the modified version of the Barthel Index after the bladder and bowel items had been removed (χ162=32.4, P=.01). However, removal of these continence items reduced the item-trait chi-square value for both versions of the Barthel Index and indicated improvement in the fit of the data to the Rasch model. Disordered Thresholds Items from both versions of the Barthel Index had disordered thresholds. For the original version, these items were feeding, stairs, bowels, and walking. There was no person ability level at which it was most likely that assistance was required for feeding, stairs, and bowels. For walking, there was no person ability level at which the most likely response was to be independent with a wheelchair. Therefore, these items were rescored to remove disordered thresholds. There were no persons who were “independent with their wheelchair mobility,” so this category was combined with “unable to walk.” “Assistance with” feeding, stairs, and bowels was combined with “unable” to perform these items. For the MBI, 2 items, stairs and bowel control, had disordered thresholds. There was no person ability level at which it was most likely that “substantial assistance” is required for ascending and descending stairs. Similarly, there was no person ability level at which the most likely response was to require “moderate help” with bowel continence. To remove disordered thresholds, the stairs and bowel items were rescored by combining the 3 assistance response options. Rasch Analysis After Rescoring Disordered Thresholds Fit of the data to the Rasch model was not achieved by rescoring disordered thresholds for the original (χ202=161.06, P=.00) or modified (χ202=74.95, P=.00) versions of the Barthel Index. The item-trait interaction chi-square value for the original version of the Barthel Index was reduced by rescoring disordered thresholds and indicated an improvement in the fit of the data to the Rasch model. In contrast, the chi-square value for the modified version of the Barthel Index increased and indicated poorer fit of the data to the Rasch model after the rescoring of disordered thresholds. After rescoring disordered thresholds, the bladder item continued to display a high positive fit residual for the MBI but not for the original Barthel Index. No items had fit residuals greater than 2 for the original version of the Barthel Index. Many items continued to show misfit to the Rasch model for both versions of the Barthel Index with high negative fit residuals (indicating item redundancy) and significant chi-square and/or F statistic probability values (see table 2). The same item, toilet use, continued to show uniform DIF by age for the MBI. Rescoring the disordered thresholds resulted in a reduction in the percentage of individual t tests outside the acceptable range (see table 2). This indicates an improvement in the unidimensionality of both versions of the Barthel Index after the rescoring of the disordered thresholds. Nevertheless, the percentage of individual t tests remained greater than 5% for both versions. The examination of the residual correlation matrix indicated negative correlations of greater than 0.3 between the bowel and stairs items (r=−.31) for the original Barthel Index and the bladder control item with bathing (r=−.36), toilet (r=−.44), stairs (r=.33), dressing (r=−.45), and walking (r=−.36) items for the MBI. For the MBI, these findings support the high positive fit residual identified for the bladder item (ie, the likely measurement of another construct). A positive correlation of greater than 0.3 was only identified between the walking and transfers (r=.36) items for the MBI and indicates response dependency between these mobility-assessment tasks (ie, violation of the assumption of local dependence). The removal of the misfitting bladder item from the MBI (after rescoring disordered thresholds) did not result in the data fitting the Rasch model (χ182=49.8, P=.00), and the percentage of t tests outside the acceptable range also remained greater than 5% (12.74%; 95% CI, 9–16). However, the reduced item-trait chi-square value indicated improved fit of the data to the Rasch model. Person-Item Threshold Map After Rescoring Disordered Thresholds Person ability and item thresholds are shown on the same logit scale in the person-threshold map for the original Barthel Index and the MBI in Fig 1, Fig 2, respectively. Higher person ability and higher item difficulty are located to the right of 0. Person ability is spread above and below the range of measurement captured by both versions of the scale. At hospital admission, there were 16.1% of persons who scored above the upper band of the 95% CI for the SE of measurement for the most difficult item, stairs, on the original version of the Barthel Index and 5.6% for the MBI. For both versions of the Barthel Index, only a small number of persons were unable to complete an item, and, therefore, a floor effect was not identified. After item rescoring, the person separation index was .92 and .94 for the original version and the MBI, respectively. Validation in the Discharge Dataset The examination of the discharge dataset for the original (n=167) and modified (n=126) Barthel Index validated the findings from the analysis of the hospital admission dataset. Barthel Index data did not fit the Rasch model for the original (χ202=37.56, P=.01, n=119) or modified version of the Barthel Index (χ202=78.61, P=.00, n=112). However, the bladder item only showed misfit for the MBI with a high positive fit residual of 3.93. The walking and stairs items had disordered thresholds for the original and MBI, respectively. After rescoring disordered thresholds in the discharge dataset, the original Barthel Index data demonstrated fit to the Rasch model (χ202=27.14, P=.13), and for the percentage of individual t tests outside the acceptable range, the lower band of the 95% CI overlapped 5% (5.04%; 95% CI, 1–9). Despite the original Barthel Index satisfying these strict unidimensionality requirements, 2 items showed mild misfit to the model. The toilet use (−2.3) and dressing (−2.16) items had high negative fit residuals, indicating item redundancy. However, had a cut point of 2.5 been selected for item fit residuals in this study (instead of 2), these items would not have been identified as misfitting. Fit to the model was also identified in this dataset in the item (mean ± SD, −0.4±1.25) and person (mean, −.30±.58) fit residual statistics. After rescoring disordered thresholds, the MBI data did not fit the Rasch model in the discharge dataset (χ202=76.36, P=.00), and the bladder item continued to show misfit with a high positive fit residual (+3.56). The percentage of individual t tests outside the acceptable range was greater than 5% (18.85%; 95% CI, 15–23). The proportion of patients scoring above the SE of measurement for the most difficult item was 45.9% and 13.5% at hospital discharge for the original and modified versions, respectively. Item Hierarchy for the Original Compared With the MBI Figure 3 compares the item hierarchy for the original and modified versions of the Barthel Index based on hospital admission data after rescoring disordered item thresholds. Discussion  The Barthel Index is widely used as a method for measuring and monitoring changes in activity limitation for older medical patients in the acute care setting.21, 22 However, the results of Rasch analysis in this study indicate that the Barthel Index is not a unidimensional measure of ADL function for this patient population. More than half of the items in both the original and modified versions of the Barthel Index showed misfit to the Rasch model. Therefore, summation of Barthel Index items to produce a total ADL function score is not valid in this patient population. Lack of fit of data to the Rasch model also indicates that Barthel Index raw scores cannot be transformed to interval level data. In some parts of the Barthel Index, it is easier to register a change in Barthel Index score compared with other parts of the scale. A boundary effect is a common feature of ordinal scales in which it is more difficult to achieve a score change at the scale extremes. In this study, the MBI showed increased difficulty to achieve a change score at the upper-scale extreme, and the original version of the Barthel Index showed increased difficulty to achieve a change score at the upper- and lower-scale extremes.12 Other studies of the Barthel Index have reported a lack of fit of the data to the Rasch model because of the continence items compromising unidimensionality.6, 8, 11 In the current study, although many Barthel Index items showed misfit to the Rasch model, only the bowel and/or bladder items had fit residuals greater than 2, indicating the likely measurement of another construct for these items. Conceptually, the continence items appear different to the other items. The World Health Organization International Classification of Functioning, Disability and Health23 classifies incontinence as impairments of body function compared with the other items, which it classifies as activity limitations. Rasch analysis in this study has confirmed that the bladder item for the original Barthel Index and the bladder and bowel items for the MBI do not measure the same underlying construct as the other items for older acute medical patients. However, in contrast to the results reported by van Hartingsveld et al,11 after removing the relevant misfitting continence items in this study, neither the original nor modified versions of the Barthel Index fitted the Rasch model. The targeting of person ability and item difficulty was shown to be poorly matched for both versions of the Barthel Index for older acute medical patients. The range of person abilities exceeded the range of item difficulty at both ends of the scale. In particular, both versions of the Barthel Index had a ceiling effect in which a large proportion of patients were unable to be located on the Rasch-converted Barthel Index scale. The original Barthel Index had a ceiling effect at hospital admission, and the original and MBI had a ceiling effect at hospital discharge. As patient health improved across acute hospital length of stay, both versions of the Barthel Index had a greater proportion of patients unable to be located on the Barthel Index after Rasch analysis. Some authors have proposed using 8-,11 5-,24 and 3-item25 versions of the Barthel Index, but the ceiling effect would remain if applied in an older acute general medical patient population. This research indicates that neither the original nor the MBI are unidimensional scales. An exception to this occurred when the original Barthel Index was rescored and only then for discharge and not for admission Barthel Index scores. Because clinicians do not typically rescore outcomes obtained by using the Barthel Index, these findings, combined with the unacceptable ceiling effects of either version, render the Barthel Index an assessment tool with limited utility for monitoring patient health. The item hierarchy was similar between the original Barthel Index and the MBI in this study (see fig 3). For both versions of the Barthel Index, feeding and bowel continence were ranked as the easiest items. Stair climbing, bathing, and ambulation were ranked as the 3 most difficult items for both versions. For the 5 mid-range items, the hierarchical ordering of these items varied across the differing versions of the Barthel Index. Differences in item scoring and protocol between the original and modified versions of the Barthel Index are likely to explain some of the differences in hierarchy between versions. Only 1 previous study8 provided Barthel Index item location estimates with accompanying 95% CI data in the published report. In this previous study, MBI data were Rasch analyzed for 50 stroke and 50 spinal cord injury (SCI) patients. Item difficulty for the MBI in the current study showed greater similarity with the item difficulty for the SCI population than for the stroke population. Compared with the MBI in the current study, item location 95% confidence bands did not overlap for 6 items (feeding, bowel, transfers, bladder, dressing, stairs) in the stroke population and 5 items (bowel, transfers, toilet use, walking, stairs) in the SCI population. Disparity between reports may be explained by sample, cultural, or diagnostic differences, the translation of the Barthel Index to Turkish, or the modest sample size (50 stroke, 50 SCI patients) available for Rasch analysis in the previous study.8 However, regardless of the reason, the Barthel Index items appear to perform variably across differing patient groups. Study Limitations A limitation of this study was that the proportion of patients for whom the Barthel Index was completed by others (ie, nurse or family member) was not recorded. Another possible limitation is that there was a greater than 15% loss to follow-up for discharge destination data for the original and Modified Barthel Index. Therefore, a smaller sample size was available for Rasch analysis in the discharge compared with the admission dataset for both the original and Modified Barthel Index. The influence of these factors on the results of Rasch analysis in this study is not known. Conclusions  The results of Rasch analysis in this study indicate that the Barthel Index is not a unidimensional measure of ADL function for older acute medical patients, and, therefore, the summation of Barthel Index item scores is not valid in this patient population. In addition, many older acute medical patients have modest limitations in their ADL function, and, therefore, the Barthel Index does not have adequate scale width to accurately monitor changes in ability for these patients. An improved method for accurately measuring and monitoring changes in activity limitation for older acute medical patients is required. Suppliers Acknowledgment  We acknowledge the support of the Clinical Epidemiology and Health Service Evaluation Unit, Melbourne Health who kindly provided their data for secondary analysis in this study. References  1. 1Mahoney F, Barthel D. Functional evaluation: the Barthel Index. Md State Med J. 1965;14:61–65. MEDLINE 2. 2Wade D. Measurement in neurological rehabilitation. Oxford: Oxford Univ Pr; 1992;. 3. 3Royal College of Physicians. Standardised assessment scales for elderly people (Report of joint workshops of the Research Unit of the Royal College of Physicians and the British Geriatric Society). London: RCP; 1992;. 4. 4Shah S. In praise of the biometric and psychometric qualities of the Barthel Index. Physiotherapy. 1994;80:769–771. 5. 5Hsieh CL, Hsueh IP. A cross-validation of the comprehensive assessment of activities of daily living after stroke. Scand J Rehabil Med. 1999;31:83–88. MEDLINE |
CrossRef
6. 6Tennant A, Geddes J, Chamberlain M. The Barthel Index: an ordinal score or interval level measurement?. Clin Rehabil. 1996;10:301–308.
CrossRef
7. 7Barer D, Murphy J. Scaling the Barthel: a 10-point hierarchial version of the activities of daily living index for use with stroke patients. Clin Rehabil. 1993;7:271–277.
CrossRef
8. 8Kucukdeveci A, Yavuzer G, Tennant A, Suldur N, Sonel B, Arasil T. Adaption of the modified Barthel Index for use in physical medicine and rehabilitation in Turkey. Scand J Rehabil Med. 2000;32:87–92. MEDLINE |
CrossRef
9. 9Waugh R. An analysis of dimensionality using factor analysis (true score theory) and Rasch measurement: what is the difference? (Which method is better?). J Appl Meas. 2005;6:80–99. MEDLINE 10. 10Wright B, Stone M. Best test design. Chicago: Mesa Pr; 1979;. 11. 11van Hartingsveld F, Lucas C, Kwakkel G, Lindeboom R. Improved interpretation of stroke trial results using empirical Barthel item weights. Stroke. 2006;37:162–166.
CrossRef
12. 12de Morton NA, Jones CT, Keating JL, et al. The effect of exercise on outcomes for hospitalised older acute medical patients: an individual patient data meta-analysis. Age Ageing. 2007;36:219–222. MEDLINE 13. 13de Morton NA, Keating JL, Berlowitz DJ, Jackson B, Lim WK. Additional exercise does not change hospital or patient outcomes in older medical patients: a controlled clinical trial. Aust J Physiother. 2007;53:105–111. MEDLINE 14. 14Jones C, Lowe A, Tweddle N, McGregor L, Russell D, Brandt C. A randomised controlled trial of an exercise intervention to reduce functional decline and health service utilization in the hospitalized elderly. Aust J Ageing. 2006;25:126–133. 15. 15Shah S, Vanclay F, Cooper B. Improving the sensitivity of the Barthel Index for stroke rehabilitation. J Clin Epidemiol. 1989;42:703–709. MEDLINE |
CrossRef
16. 16Smith E. Effect of item redundancy on Rasch item and person estimates. J Appl Meas. 2005;6:147–163. MEDLINE 17. 17Smith E. Detecting and evaluating the impact of multidimensionality using item fit statistics and principal components analysis of residuals. J Appl Meas. 2002;3:205–231. MEDLINE 18. 18Tennant A, Pallant J. Unidimensionality matters! (A tale of two Smiths?). Rasch Meas Trans. 2006;20:1048–1051. 19. 19Binomial calculator. http://home.clara.net/sisa/binomial.htm. 20. 20Linacre J. Sample size and item calibration stability. Rasch Mes Trans. 1994;7:328. 21. 21de Morton N, Keating J, Jeffs K. Exercise for acutely hospitalised older medical patients. Cochrane Database Syst Rev. 2007;(1):. 22. 22de Morton NA, Keating JL, Jeffs K. The effect of exercise on outcomes for older acute medical inpatients compared with control or alternative treatments: a systematic review of randomized controlled trials. Clin Rehabil. 2007;21:3–16. MEDLINE |
CrossRef
23. 23World Health Organization. International classification of functioning, disability and health. Geneva: WHO; 2001;. 24. 24Hobart J, Thompson A. The five item Barthel Index. J Neurol Neurosurg Psychiatry. 2001;71:225–230. MEDLINE |
CrossRef
25. 25Ellul J, Watkins C, Barer D. Estimating total Barthel scores from just three items: the European Stroke database ‘minimum dataset’ for assessing functional status at discharge from hospital. Age Ageing. 1998;27:115–122. MEDLINE |
CrossRef
a Department of Physiotherapy, Monash University, Victoria, Australia b School of Physiotherapy, La Trobe University, Victoria, Australia. Reprint requests to Natalie A. de Morton, PhD, Dept of Physiotherapy, School of Primary Health Care, Faculty of Medicine, Nursing and Health Sciences, Monash University-Peninsula Campus, PO Box 527, Frankston, Victoria, Australia 3199
Supported by the National Health and Medical Research Council of Australia (Dora Lush Postgraduate Scholarship no. 280632). No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit upon the authors or upon any organization with which the authors are associated. PII: S0003-9993(08)00027-0 doi:10.1016/j.apmr.2007.10.021 © 2008 American Congress of Rehabilitation Medicine and the American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved. | |
|