| | Reliability and Agreement of 5 Ergo-Kit Functional Capacity Evaluation Lifting Tests in Subjects With Low Back PainAbstract Gouttebarge V, Wind H, Kuijer PP, Sluiter JK, Frings-Dresen MH. Reliability and agreement of 5 Ergo-Kit functional capacity evaluation lifting tests in subjects with low back pain. ObjectiveTo assess interrater reliability and agreement of 5 Ergo-Kit functional capacity evaluation lifting tests in subjects with low back pain (LBP). DesignWithin-subjects design, with 2 repeated measurements. SettingAcademic medical center in The Netherlands. ParticipantsTwenty-four subjects (10 men, 14 women) with LBP. InterventionsNot applicable. Main Outcome MeasureFive Ergo-Kit lifting tests (2 isometric, 3 dynamic) were assessed on 2 occasions (t1, t2), by 2 different raters (R1, R2). The interval between the test sessions was 3 days. Interrater reliability level was expressed with the intraclass correlation coefficient (ICC), and the level of agreement between raters with the standard error (SE) of measurement. ResultsICCs means (reliability) of isometric and dynamic Ergo-Kit lifting tests ranged from .94 to .97, and SE of measurement values (agreement) ranged from 1.9 to 8.6kg. ConclusionsThere was good reliability and agreement between raters of the isometric and dynamic Ergo-Kit lifting tests in subjects with LBP, which supports the use of these tests to assess functional lifting capacity. LOW BACK PAIN (LBP) is recognized as a major public health problem throughout the world. In fact, it is the most common and most costly musculoskeletal disorder in all industrialized countries.1, 2, 3, 4 The sickness-related absences and work disability claims resulting from LBP place a tremendous financial strain on patients and their communities.5, 6, 7, 8, 9, 10 Given this condition’s social impact and its financial ramifications for society, professionals in work disability and rehabilitation care need clinical instruments to accurately assess the functional capacity of LBP patients and thus enhance the effectiveness of the return to work process. Clinical instruments are principally used to measure relevant changes in people over time.11 The purpose of functional capacity evaluation (FCE) methods is to provide comprehensive, performance-based assessments that measure the current functional physical abilities of people with musculoskeletal complaints.12, 13, 14, 15, 16 In the Netherlands, the Ergo-Kit is an FCE method devised to report the functional physical capacity of workers. The Ergo-Kit relies on a battery of standardized tests that reflect work-related activities such as standing, walking, lifting, carrying, and reaching.17 As with any clinical test or instrument, the clinimetric properties of the Ergo-Kit must be defined before it can be legitimately applied for discriminative or evaluative purposes.18, 19, 20 Clinimetric properties, also referred to as psychometric properties,21 reflect the quality of clinical measurements and are crucial in selecting and using instruments—either for clinical practice or research.22, 23 Several studies have shown that despite their use in both evaluative and discriminative settings, there is little information currently available about the clinimetric properties of FCE methods (including the Ergo-Kit), such as their reproducibility, reliability, responsiveness, and validity.16, 24, 25, 26, 27, 28, 29 Reproducibility is a major quality indicator26, 30, 31 and relates to 2 concepts that are not always differentiated from each other: reliability and agreement.23 Reliability refers to the test’s ability to distinguish one subject from another despite any measurement errors. Agreement, on the other hand, concerns the absolute measurement error, as it evaluates how close the scores are in repeated measurements.23, 32 Reproducibility studies should address populations that are relevant to the implementation of tests or instruments in the field.33 The reliability of the Ergo-Kit isometric and dynamic lifting tests has been assessed in adults with no musculoskeletal complaints,34 but they should also be evaluated in subjects who do report these complaints. It is also important to establish interrater reliability and agreement to ensure adequate and meaningful interpretation of variations in the test measurements of different raters.35 In this study, we evaluated the reproducibility (ie, reliability and agreement between raters) of the Ergo-Kit isometric and dynamic lifting tests in subjects with LBP. Methods  Participants Fourteen physiotherapy (PT) centers in the southern section of Amsterdam were contacted for permission to recruit patients from their practices. All patients were initially contacted by their physiotherapists, who briefly explained the experimental procedures. The patients interested in participating received a folder containing detailed information on the study protocol and were asked to contact the first author (VG). Participant eligibility was determined through telephone interviews, during which potential subjects were asked several questions that were intended to determine whether the subjects met the 3 inclusion criteria: (1) age between 18 and 65 years, (2) had LBP in the last 3 months, and (3) because of LBP, had limited physical capacity in daily activities at home and at work. We defined LBP as 1 or more episodes of pain or stiffness in the low back area within the past 3 months that lasted for a minimum of 7 consecutive days. A power analysisa (confidence interval [CI] method with confidence level of .95, correlation coefficient sets at .90 and limit at .80) indicated that 23 subjects were required for the study. Prior to enrollment, subjects received verbal and written information on the study procedures and signed statements of informed consent. In addition, the subjects were free to withdraw from the study at any time. The study was performed in accordance with the Declaration of Helsinki and was approved by the Medical Ethics Committee of the Academic Medical Centre in Amsterdam. Ergo-Kit Tests: Selection, Description, and Outcomes The standard protocol of the Ergo-Kit assesses 55 subtests, and takes approximately 3 hours to complete. Of the 7 Ergo-Kit physical agility tests concerned with manipulation, balance, strength, and endurance tests that are associated with musculoskeletal complaints, 2 have been shown to be unreliable in adults without musculoskeletal complaints.34 Consequently, we used only 5 of the lifting tests in this study (fig 1). Two were isometric lifting tests: a back-torso lift test (BTLT) and a shoulder lift test (SLT). The other 3 were dynamic lifting tests: carrying lifting strength test (CLST), lower lifting strength test (LLST), and upper lifting strength test (ULST). Table 1 presents Ergo-Kit lifting test descriptions and outcomes. Standardized procedures were performed as described in the Ergo-Kit handbook.17 The Ergo-Kit protocol normally includes 2 tests on the Jamar hand dynamometer, 4 reach tests, and 5 manipulation tests between these 5 lifting tests. The testing order for the 5 lifting tests was not modified. | | |  | Ergo-Kit Tests | Description | Outcome |  |
|---|
 | Back-torso lift test (BTLT) Shoulder lift test (SLT) | Use of a “back and leg dynamometer” fixed on a platform, a chain and a handle. Handle is set at patella height for BTLT (fig 1A) and at elbow height for SLT (fig 1B). Maximal pulling during 4 s, 2 tries per test | Maximal isometric lift capacity (kg) |  |  | Carrying lifting strength test (CLST) Lower lifting strength test (LLST) Upper lifting strength test (ULST) | Use of a stand with two vertically adjustable shelves, a box with different weights and a step (20cm). Following standardized procedure, weight is added to the box (2.5, 5, 7.5 or 10 kg), depending on the subject’s coordination in the task, subject’s perception of the weight of the box, and subject complaints. 4−6 carries 5 m for CLST (fig 1C), 4−6 lifts from knuckle height to step for LLST (fig 1D) and 4−6 lifts from knuckle to acromion height for ULST (fig 1E). | Maximal safe weight for lifting (kg) |  | | | |
Raters A list of the 32 available raters in the Netherlands who were certified for Ergo-Kit assessment was obtained from the provider of this FCE method. All had completed the same training program, which consisted of 4 instruction days and at least 12 hours of practice. Because the test assessments were to take place in Amsterdam, selection was limited—for practical reasons—to raters who worked within a 40-km radius of the city. This left 3 raters, 2 of whom were selected at random and agreed to participate. Both raters (R1, R2) had between 4 and 5 years of experience performing the assessments. The raters received financial compensation and travel reimbursement for their participation. Procedure We used a within-subjects design to assess reliability and agreement. Each subject was assessed at 2 different times (t1, t2) by the 2 different raters (R1, R2). Raters assessed all subjects independently and were blinded to the other’s test results. The time interval between t1 and t2 was set at 3 days, as this was considered sufficient to prevent carry-over effects and to give subjects time to recover from the first assessment.36, 37, 38 In addition, each subject was assessed at the same time of the day.39 Subjects were divided into 2 groups, based on their availability, and the raters assessed both groups in counterbalanced order: 1 subject group was assessed at t1 by R1 and at t2 by R2, and 1 group at t1 by R2 and at t2 by R1. Prior to the second assessment, all subjects were asked whether they had recovered satisfactorily from the first assessment. If they had not, they were not allowed to undergo the second assessment, but were permitted to participate in 2 new test sessions at a later date. LBP: Pain Intensity and Disability Before both assessments, the patients were asked to complete an existing Dutch translation of the Von Korff questionnaire40 about their LBP and related disability. This was done to permit us to evaluate whether their health status had changed between t1 and t2. The Von Korff questionnaire has shown a moderate-to-good correlation with other self-reported disability instruments such as the Medical Outcome Study 36-Item Short-Form Health Survey and the Roland-Morris Disability Questionnaire; it has been evaluated as reliable and valid in study samples similar to the one in this study.41, 42 The Von Korff questionnaire assesses pain and disability experienced in the past 6 months; therefore in order to fit our inclusion criteria, we adjusted it to consider only a 3-month prevalence of LBP and disability. Current pain intensity was assessed with 3 questions that were scored on a scale of 0 (no pain) to 10 (the worst pain possible). Disability due to LBP was assessed with 4 questions about the number of days the subjects were disabled and their ability to perform activities and/or work (scored on a scale of 0 to 10). Two total scores were calculated: a 0 to 100 pain intensity score based on the mean of the pain intensity questions multiplied by 10, and a 0 to 100 disability score based on the mean of the disability questions multiplied by 10.40 Kinesiophobia and LBP Subjects were asked to fill in the Dutch version of the Tampa Scale of Kinesiophobia (TSK)43 to assess their fear of reinjury caused by physical movement and activity. The TSK covers 17 items, each of which is scored on a 4-point Likert scale ranging from “strongly disagree” to “strongly agree.” For each subject, a total score ranging from 17 to 68 was calculated after inversion of the individual scores for items 4, 8, 12, and 16. The TSK has shown good reliability and validity in different study populations.44, 45 The TSK was completed after each test session to avoid eventual effects on subjects’ performance provided by the assessment of this questionnaire. Data Analysis Means, standard deviations (SDs), and ranges were calculated for each test for raters 1 and 2. The level of reliability was expressed with an intraclass correlation coefficient (ICC)20, 46, 47 and determined with the test scores assessed by the 2 raters. We used the ICC model 2.1.A, based on a mixed 2-way analysis of variance, as defined by Shrout and Fleiss.48 The 95% CI was calculated for each ICC mean. The ICC and 95% CI values were evaluated as follows20, 26, 49, 50: “low” reliability when ICC means and/or CI lower bounds were lower than .50; “moderate” reliability when ICC means and/or CI lower bounds ranged from .50 to .80; and “high” reliability when ICC means and CI lower bounds were greater than .80. To assess the raters’ stability in repeated measurements over time and to gain an insight into the clinical relevance of the Ergo-Kit lifting tests, agreement was expressed with the standard error (SE) of measurement (SE of measurement = √[var(raters) + var(error)] or SE of measurement = SD × √[1 – ICC]) and its 95% CI (95% CI = 1.96 × SE of measurement).20, 33 Using a general linear model, we calculated 3 different components of variation, variance between subjects (var[subjects]), variance between raters (var[raters]), and variance due to measurement error (var[error]). To explore the stability of the patients’ health status from 1 test to another, their mean scores for LBP pain intensity and related disability (disability score), and kinesiophobia were calculated from the Von Korff questionnaire and TSK at t1 and t2. For these 3 variables, statistical differences between t1 and t2 were explored with paired t tests. All analyses were performed with the statistical analysis software SPSSb for Windows. Results  Participant Characteristics Twenty-five subjects with LBP (11 men, 14 women) were recruited for this study. All subjects were working either parttime or full-time in a variety of professions. One subject was not able to perform the second test assessment because he did not recover properly from the first test. The subjects’ mean age ± SD was 49±8 years (range, 34–63y), their mean height was 175cm (range, 158–195cm), and their mean body weight was 78kg (range, 48–97kg). There were few differences between t1 and t2 in terms of the subjects’ LBP pain intensity (P=.003) and related disability, and their subjects’ TSK mean scores (table 2). These small differences in average pain intensity, average disability, and average TSK scores, however, do not appear to be clinical relevant changes within subjects from 1 test session to another. Reliability Table 3 presents the averages, SDs, and ranges in scores for all 5 Ergo-Kit tests for both sessions assessed by the raters, their mean ICCs, and corresponding 95% CIs. The level of interrater reliability was high for both isometric strength tests (BTLT, SLT), as their mean ICCs were .97 and .96, respectively, with CI lower bounds of .94 and .91, respectively. The mean ICCs for the 3 dynamic strength tests were .95 for the CLST and the ULST, and .94 for the LLST. The corresponding CI lower bounds (.84, .89, and .85, respectively) are considered highly reliable. | | |  | Tests (kg) | N | Rater 1 (s) | Rater 2 (s) | ICC | ICC 95% CI |  |
|---|
 | BTLT | 24 | 65.9±38.3 | 20.5–180.5 | 63.3±39.5 | 12.5–177.5 | .97 | .94–.99 |  |  | SLT | 24 | 37.6±18.3 | 14.0–72.0 | 38.9±19.1 | 10.0–76.5 | .96 | .91–.98 |  |  | CLST | 24 | 24.5±9.7 | 10.0–47.5 | 22.1±11.2 | 7.5–47.5 | .95 | .84–.98 |  |  | LLST | 24 | 23.8±11.1 | 7.5–47.5 | 21.8±10.6 | 7.5–47.5 | .94 | .85–.97 |  |  | ULST | 24 | 17.0±6.3 | 7.5–32.5 | 17.1±6.6 | 7.5–30.0 | .95 | .89–.98 |  | | | |
Variation Components The variation components (between subjects, between raters, systematic error) for all 5 tests are presented in table 4. Given the ratio between all 3 variation components in all 5 tests, var[raters] is relatively small, whereas var[subjects] is relatively high. Agreement Table 4 presents the SE of measurement and the 95% CI for each test, which offers a clear picture of the agreement between the raters. The SE of measurements, expressed in kilograms, are small, especially given the mean values of the different tests (see table 3). For instance, the BTLT mean score from both test assessments approaches 64kg, its SE of measurement 8.6kg, and its CI 47 to 81kg. This indicates that an increase or decrease of 17kg from the observed score cannot be interpreted as a change resulting from a measurement error. Discussion  Our purpose in this study was to evaluate the reliability and agreement between 2 raters of 5 Ergo-Kit lifting tests in subjects with LBP. For both of the isometric lifting tests, reliability between raters was considered high, a finding that is in line with other studies.51, 52, 53, 54 The 3 dynamic lifting tests were also found to be highly reliable. The ICC is an accepted measure of reliability when it comes to the discriminative capacity of a test. ICC values are sensitive to the heterogeneity of the study population: when measurement error variability is small compared to the performance variability between subjects, ICC values can be high, approaching 1, as they did in this study. The SDs in all 5 tests were high, showing significant variability in test scores between subjects. Our findings in this study are in line with reliability studies of other types of lifting tests,14, 27, 52, 55, 56, 57 but especially of another FCE method, the Isernhagen Work System (IWS).29, 58, 59, 60, 61, 62 The methods differ in their design (material used and needed) and assessment method (step-by-step test protocol to get the end point of the lifting tests). The IWS uses a kinesiophysical approach, relying on the therapist’s expertise (observations) to determine maximum lifting capacity rather than on patient reports (pain, discomfort), while the Ergo-Kit is based on both the therapist’s expertise and patient reports. It is possible, however, to draw comparisons of studies of both lifting tests because dynamic lifting capacity seems to be the construct being measured in both Ergo-Kit and IWS lifting tests. Gross and Battie,58 Brouwer et al,59 and Reneman et al60 performed their studies also with subjects with LBP, quantified their outcomes with an ICC as well, and found high levels of intra- and interrater reliability of IWS dynamic lifting tests (ICC range, .75–.98). So, because dynamic lifting tests from both Ergo-Kit and IWS FCE methods are reliable, it would be relevant to assess these lifting tests concurrently with the same subjects to obtain an appropriate insight into whether they could be used interchangeably. This study is the first to evaluate agreement between FCE tests. For agreement, different statistics are commonly used, such as the Bland-Altman visual plotting method,63 smallest real difference,64 and SE of measurement. As variations between raters were nearly nil (see table 4), it can be concluded that the variations in test scores between both assessments were not the result of disagreement between the 2 raters, but rather to performance variations within subjects. Because the Ergo-Kit tests are used in PT settings for evaluative purposes, reproducibility and responsiveness are 2 major properties that need to be evaluated.11 In this study, we examined the reproducibility of the tests by calculating the SEs of measurement and CIs. Some suggestions about the responsiveness of these tests may also be made, however. For instance, the Ergo-Kit tests should be able to detect clinically relevant changes within subjects during repeated evaluations throughout a rehabilitation program. Safe amount of minimal change that has to be found to conclude that change in subjects’ performance is due to a real change and not to measurement error, may be found by checking the SEs of measurement and CIs. As in other studies,65, 66 SEs of measurement could be expressed as a percentage of the mean test score (ie, at t1: BTLT, 13.1%; SLT, 13.3%; CLST, 13.9%; LLST, 15.5%; ULST, 11.2%). Because no similar data of the SE of measurement and SE of measurement percentage for FCE lifting tests have been previously reported, the set-up of an SE of measurement’s cutoff value for clinical relevance cannot be retrieved from literature, and should be based on the practitioner’s knowledge of the lifting tests. The SE of measurement percentage values we found in this study were lower than the ones found for isokinetic strength tests,65, 66 which suggests a sufficient agreement level of the Ergo-Kit lifting tests, and thus makes their clinical use legitimate. Conclusions  Our results suggest that the reproducibility (ie, reliability and agreement between raters) of 5 Ergo-Kit tests in subjects with LBP was good. Criterion-related and construct validity appear to be the topics that merit the most attention in future studies. Suppliers Acknowledgments  We thank Henrica C. de Vet, PhD, for her statistical advice, and the 2 raters who assessed the Ergo-Kit tests for their time and effort in participating in this study. References  1. 1Wyatt M, Underwood MR, Scheel IB, Cassidy JD, Nagel P. Back pain and health policy research: the what, why, how, who, and when. Spine. 2004;29:E468–E475.
CrossRef
2. 2Liddle SD, Baxter GD, Gracey JH. Exercise and chronic low back pain: what works?. Pain. 2004;107:176–190. Abstract | Full Text |
Full-Text PDF (263 KB)
|
CrossRef
3. 3Kääriä S, Kaila-Kangas L, Kirjonen J, Riihimaki H, Luukkonen R, Leino-Arjas P. Low back pain, work absenteeism, chronic back disorders, and clinical findings in the low back as predictors of hospitalization due to low back disorders: a 28-year follow-up of industrial employees. Spine. 2005;30:1211–1218.
CrossRef
4. 4Speed C. Low back pain. BMJ. 2005;328:1119–1121. 5. 5Elders LA, Burdorf A. Interrelations of risk factors and low back pain in scaffolders. Occup Environ Med. 2001;58:597–603. MEDLINE |
CrossRef
6. 6IJzelenberg W, Burdorf A. Impact of musculoskeletal co-morbidity of neck and upper extremities on healthcare utilisation and sickness absence for low back pain. Occup Environ Med. 2004;61:806–810.
CrossRef
7. 7Institute for Employee Benefit Schemes UWV. [Statistical information on medical classification in work disability claim 1999] [Dutch]. Amsterdam: UWV; 2001;. 8. 8Central Bureau of Statistics. Available at: http://statline.cbs.nl/StatWeb/Start.asp?lp=Search/Search&LA=EN&DM=SLEN. Accessed June 7, 2006. 9. 9Picavet HS. [Multimedia campaign on low back pain prevention: potential health benefits] [Dutch]. Bilthoven: RIVM; 2004;. 10. 10Grabois M. Management of chronic low back pain. Am J Phys Med Rehabil. 2005;84(3 Suppl):S29–S41. MEDLINE 11. 11Guyatt G, Walter S, Norman G. Measuring change over time: assessing the usefulness of evaluative instruments. J Chronic Dis. 1987;40:171–178. MEDLINE |
CrossRef
12. 12King PM. Sourcebook of occupational rehabilitation. New York: Plenum Pr; 1998;. 13. 13Strong S. Functional Capacity Evaluation: the good, the bad and the ugly. Occup Ther Now. 2002;4:5–9. 14. 14Tuckwell NL, Straker L, Barrett TE. Test-retest reliability on nine tasks of the Physical Work Performance Evaluation. Work. 2002;19:243–253. MEDLINE 15. 15Vasudevan SV. Role of functional capacity assessment in disability evaluation. J Back Musculoskeletal Rehabil. 1996;6:237–248. 16. 16Strong S, Baptiste S, Cole D, Costa M. Functional assessment of injured workers: a profile of assessor practices. Can J Occup Ther. 2004;71:13–23. MEDLINE 17. 17[Ergo-Kit for functional capacity evaluation: user manual] [Dutch]. Enschede: Ergo Control; 2002. 18. 18Matheson LN, Mooney V, Grant JE, Leggett S, Kenny K. Standardized evaluation of work capacity. J Back Musculoskeletal Rehabil. 1996;6:249–264. 19. 19Mooney V. Functional capacity evaluation. Orthopedics. 2002;25:1094–1099. MEDLINE 20. 20Portney LG, Watkins MP. Foundations of clinical research: applications to practice. 2nd ed.. Upper Saddle River: Prentice-Hall; 2000;. 21. 21Streiner DL. Clinimetrics vs. psychometrics: an unnecessary distinction. J Clin Epidemiol. 2003;56:1142–1145. Abstract | Full Text |
Full-Text PDF (79 KB)
|
CrossRef
22. 22Feinstein AR. Clinimetrics. New Haven: Yale Univ Pr; 1987;. 23. 23de Vet HC, Terwee CB, Bouter LM. Current challenges in clinimetrics. J Clin Epidemiol. 2003;56:1137–1141. Abstract | Full Text |
Full-Text PDF (85 KB)
|
CrossRef
24. 24Wind H, Gouttebarge V, Kuijer PP, Frings-Dresen MH. Assessment of functional capacity of the musculoskeletal system in the context of work, daily living, and sport: a systematic review. J Occup Rehabil. 2005;15:253–272. MEDLINE |
CrossRef
25. 25Innes E, Straker L. Validity of work-related assessments. Work. 1999;13:125–152. 26. 26Innes E, Straker L. Reliability of work-related assessments. Work. 1999;13:107–124. 27. 27Gardener L, McKenna K. Reliability of occupational therapists in determining safe, maximal lifting capacity. Aust Occup Ther J. 1999;46:110–119. 28. 28Strong S, Baptiste S, Clarke J, Cole D, Costa M. Use of functional capacity evaluations in workplaces and the compensation system: a report on workers’ and report users’ perceptions. Work. 2004;23:67–77. MEDLINE 29. 29Gouttebarge V, Wind H, Kuijer PP, Frings-Dresen MH. Reliability and validity of Functional Capacity Evaluation methods: a systematic review with reference to Blankenship system, Ergos work simulator, Ergo-Kit and Isernhagen work system. Int Arch Occup Environ Health. 2004;77:527–537. MEDLINE |
CrossRef
30. 30Hart DL, Isernhagen SJ, Matheson LN. Guidelines for functional capacity evaluation of people with medical conditions. J Orthop Sports Phys Ther. 1993;18:682–686. MEDLINE 31. 31Nunnally JC. Psychometric theory. New York: McGraw-Hill; 1994;. 32. 32de Vet HC. Observer reliability and agreement. In: Armitag P, Colton T editor. Encyclopedia biostatistica. Vol 4:Chichester: John Wiley & Sons; 1998;p. 3123–3128. 33. 33Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use. New York: Oxford Univ Pr; 2003;. 34. 34Gouttebarge V, Wind H, Kuijer PP, Sluiter JK, Frings-Dresen MH. Intra- and interrater reliability of the Ergo-Kit FCE method in adults without musculoskeletal complaints. Arch Phys Med Rehabil. 2005;86:2354–2360. Abstract | Full Text |
Full-Text PDF (298 KB)
|
CrossRef
35. 35Mulsant BH, Kastango KB, Rosen J, Stone RA, Mazumdar S, Pollock BG. Interrater reliability in clinical trials of depressive disorders. Am J Psychiatry. 2002;159:1598–1600.
CrossRef
36. 36Marx RG, Menezes A, Horovitz L, Jones EC, Warren RF. A comparison of two time intervals for test-retest reliability of health status instruments. J Clin Epidemiol. 2003;56:730–735. Abstract | Full Text |
Full-Text PDF (463 KB)
|
CrossRef
37. 37Ästrand P, Rodahl K, Dahl HA, Stromme SB. Textbook of work physiology: physiological bases of exercise. Champaign: Human Kinetics; 2003;. 38. 38Wilmore JH, Costill DL. Physiology of sport and exercise. Champaign: Human Kinetics; 1999;. 39. 39Boadella JM, Sluiter JK, Frings-Dresen MH. Reliability of upper extremity tests measured by the Ergos Work Simulator: a pilot study. J Occup Rehabil. 2003;13:219–232. MEDLINE |
CrossRef
40. 40Von Korff M, Ormel J, Keefe FJ, Dworkin SF. Grading the severity of chronic pain. Pain. 1992;50:133–149. Abstract |
Full-Text PDF (2161 KB)
|
CrossRef
41. 41Underwood MR, Barnett AG, Vickers MR. Evaluation of two time-specific back pain outcome measures. Spine. 1999;24:1104–1112. MEDLINE |
CrossRef
42. 42Smith BH, Penny KI, Purves AM, et al. The Chronic Pain Grade questionnaire: validation and reliability in postal research. Pain. 1997;71:141–147. Abstract | Full Text |
Full-Text PDF (364 KB)
|
CrossRef
43. 43Vlaeyen JW, Kole-Snijders AM, Rotteveel AM, Ruesink R, Heuts PH. The role of fear of movement/(re)injury in pain disability. J Occup Rehabil. 1995;5:235–252.
CrossRef
44. 44Roelofs J, Goubert L, Peters ML, Vlaeyen JW, Crombez G. The Tampa Scale for Kinesiophobia: further examination of psychometric properties in patients with chronic low back pain and fibromyalgia. Eur J Pain. 2004;8:495–502. Abstract | Full Text |
Full-Text PDF (190 KB)
|
CrossRef
45. 45Swinkels-Meewisse EJ, Swinkels RA, Verbeek AL, Vlaeyen JW, Oostendorp RA. Psychometric properties of the Tampa Scale for kinesiophobia and the fear-avoidance beliefs questionnaire in acute low back pain. Man Ther. 2003;8:29–36. |
CrossRef
46. 46Fleiss JL. The design and analysis of clinical experiments. New York: Wiley Classics Library; 1986;. 47. 47Tinsley HE. Interrater reliability and agreement of subjective judgements. J Couns Psychol. 1975;22:358–376. 48. 48Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86:420–428.
CrossRef
49. 49Dimitrov D, Rumrill P, Fitzgerald S, Hennessey M. Reliability in rehabilitation measurement. Work. 2001;16:159–164. 50. 50Hripcsak G, Heitjan DF. Measuring agreement in medical informatics reliability studies. J Biomed Informs. 2002;35:99–110. 51. 51Essendrop M, Schibye B, Hansen K. Reliability of isometric muscle strength tests for the trunk, hands and shoulders. Int J Indus Ergon. 2001;28:379–387. 52. 52Horneij E, Holmström E, Hemborg B, Isberg PE, Ekdahl C. Inter-rater reliability and between-days repeatability of eight physical performance tests. Adv Physiother. 2002;4:146–160. 53. 53Perry J, Weiss WB, Burnfield JM, Gronley JK. The supine hip extensor manual muscle test: a reliability and validity study. Arch Phys Med Rehabil. 2004;85:1345–1350. Abstract | Full Text |
Full-Text PDF (194 KB)
|
CrossRef
54. 54Roy MA, Doherty TJ. Reliability of hand-held dynamometry in assessment of knee extensor strength after hip fracture. Am J Phys Med Rehabil. 2004;83:813–818. MEDLINE |
CrossRef
55. 55Lechner DE, Jackson JR, Roth DL, Straaton KV. Reliability and validity of a newly developed test of physical work performance. J Occup Med. 1994;36:997–1004. MEDLINE 56. 56Saunders RL, Beissner KL, McManis BG. Estimates of weight that subjects can lift frequently in functional capacity evaluations. Phys Ther. 1997;77:1717–1728. MEDLINE 57. 57Matheson LM, Mooney V, Grant JE, et al. A test to measure lift capacity of physical impaired adults (Part 1—Development and reliability testing). Spine. 1995;20:2119–2129. MEDLINE 58. 58Gross DP, Battie MC. Reliability of safe maximum lifting determinations of a functional capacity evaluation. Phys Ther. 2002;82:364–371. MEDLINE 59. 59Brouwer S, Reneman MF, Dijkstra PU, Groothoff JW, Schellekens JM, Goeken LN. Test-retest reliability of the Isernhagen Work Systems Functional Capacity Evaluation in patients with chronic low back pain. J Occup Rehabil. 2003;13:207–218. MEDLINE |
CrossRef
60. 60Reneman MF, Dijkstra PU, Westmaas M, Goeken LN. Test-retest reliability of lifting and carrying in a 2-day functional capacity evaluation. J Occup Rehabil. 2002;12:269–275. MEDLINE |
CrossRef
61. 61Isernhagen SJ, Hart DL, Matheson LM. Reliability of independent observer judgments of level of lift effort in a kinesiophysical functional capacity evaluation. Work. 1999;12:145–150. 62. 62Reneman MF, Jaegers SM, Westmaas M, Goeken LN. The reliability of determining effort level of lifting and carrying in a functional capacity evaluation. Work. 2002;18:23–27. MEDLINE 63. 63Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1:307–310.
CrossRef
64. 64Beckerman H, Roebroeck ME, Lankhorst GJ, Becher JG, Bezemer PD, Verbeek AL. Smallest real difference, a link between reproducibility and responsiveness. Qual Life Res. 2001;10:571–578. MEDLINE |
CrossRef
65. 65Clark DJ, Condliffe EG, Patten C. Reliability of concentric and eccentric torque during isokinetic knee extension in post-stroke hemiparesis. Clin Biomech (Bristol Avon). 2006;21:395–404. Abstract | Full Text |
Full-Text PDF (167 KB)
|
CrossRef
66. 66Gagnon D, Nadeau S, Gravel D, Robert J, Belanger D, Hilsenrath M. Reliability and validity of static knee strength measurements obtained with a chair-fixed dynamometer in subjects with hip or knee arthroplasty. Arch Phys Med Rehabil. 2005;86:1998–2008. Abstract | Full Text |
Full-Text PDF (359 KB)
|
CrossRef
Coronel Institute of Occupational Health, Academic Medical Center, Universiteit van Amsterdam, Amsterdam, The Netherlands. Reprint requests to Vincent Gouttebarge, MSc, Coronel Institute of Occupational Health, Academic Medical Center, Universiteit van Amsterdam, PO Box 22700, 1100 DE Amsterdam, The Netherlands
No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit upon the author(s) or upon any organization with which the author(s) is/are associated. PII: S0003-9993(06)00530-2 doi:10.1016/j.apmr.2006.05.028 © 2006 American Congress of Rehabilitation Medicine and the American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved. | |
|