| | Dimensions of Disordered Attention in Traumatic Brain Injury: Further Validation of the Moss Attention Rating ScaleAbstract Hart T, Whyte J, Millis S, Bode R, Malec J, Richardson RN, Hammond F. Dimensions of disordered attention in traumatic brain injury: further validation of the Moss Attention Rating Scale. ObjectivesTo investigate the factor structure of disordered attention in moderate to severe, acute traumatic brain injury (TBI) and to use factor analysis and item response theory to further validate and refine an observational rating scale of attention for clinical and research purposes. DesignMulticenter inception cohort. SettingInpatient rehabilitation units. ParticipantsPatients with TBI (N=372) consecutively admitted to 8 Traumatic Brain Injury Model System centers within 2 weeks prior to observation, who consistently followed commands and who were on stable doses of all psychotropic medications for a 3-day rating period. InterventionsNot applicable. Main Outcome MeasureParticipants were rated independently by treating occupational and physical therapists at an average of 1 month postinjury on the Moss Attention Rating Scale (MARS), a 45-item, Likert-type scale of attention-related behavior. ResultsExploratory and confirmatory factor analyses revealed 3 correlated factors of disordered attention, interpreted as restlessness/distractibility, initiation, and sustained/consistent attention. Item response (Rasch) analysis was used to eliminate redundant items and to fill gaps in item difficulty. The resulting MARS consists of 22 items that can produce 3 factor scores and a total score that covers the broad construct of disordered attention. ConclusionsThe factor-scored MARS has potential utility as a quantitative observational method with which to assess and study different dimensions of disordered attention in acute TBI, and to monitor change over time and treatment response within these dimensions.
TRAUMATIC BRAIN INJURY (TBI) is frequently followed by objective and subjective symptoms of disordered attention across the spectrum of injury severity. Problems with concentration, sustained attention, and multitasking are frequently endorsed by people with TBI, their families, and clinicians.1, 2 Previous work in our laboratory and in others has documented particular difficulties after TBI in speed of processing,3 sustained attention,4 vulnerability to distracting stimuli,5 and divided attention.6 The fact that all of these symptoms may be thought of as disordered attention speaks to the multifaceted nature of attention as a cognitive construct. In addition, recent cognitive experimental research and functional imaging studies support the concept of multiple distinct, but interactive, neural networks subserving various aspects of attention.7
Because of the importance of these problems for survivors of TBI, attention has been the target of many types of treatment, including psychoactive medications8 and cognitive retraining programs (for a recent review, see Riccio and French9). Attention is a focus of rehabilitation along the continuum of care in TBI, from inpatient rehabilitation to postacute or community re-entry settings.
Neuropsychologic tests can be useful for documenting attention deficits and treatment effects in the clinic, but they have several drawbacks. First, because of the multifaceted nature of attention, multiple neuropsychologic tests may be needed to provide a comprehensive picture, and this may require more time than is practical. Second, the highly controlled environment of psychometric testing may miss aspects of attention that are needed for real-world functioning, such as responding to unexpected events and deciding how to deploy one’s attention to cope with distractions—functions that may be impaired while more basic attentional mechanisms are preserved.10 Finally, psychometric testing may not be feasible with patients who are severely impaired and/or changing rapidly during the early phase of recovery. Previous studies have suggested that approximately 20% of hospitalized subjects with TBI are unable to participate in neuropsychologic testing because of issues such as agitation, aphonia, poor arousal, and motor impairment.11, 12
An alternative to neuropsychologic testing is the use of standardized behavioral observation methods. We have used such methods to show that, when compared with control subjects, people with TBI display more off-task behavior, accomplish less work,13 and detect and correct fewer of their own errors in naturalistic action tasks.14 Naturalistic behavioral measures have also demonstrated sensitivity to treatment with methylphenidate.8 Thus, observations of naturalistic behavior can be used to draw conclusions about attentional function, and this raises the possibility that a rating scale based on such observations could be a useful clinical assessment tool.
Following this reasoning, we developed a behavioral rating scale we named the Moss Attention Rating Scale (MARS) for use by rehabilitation clinicians to assess attentional disturbances in everyday environments. Previous efforts along these lines have included the use of several rating scales to measure attention deficit hyperactivity disorder15 and of a brief instrument that was used in a study of TBI.16 The MARS was based on extensive pilot work and has been subjected to initial reliability and validity analysis.17 For example, while there were minor differences in the scoring by occupational and physical therapists working with the same patient, there was good overall agreement between the disciplines. Rasch analysis showed a very reliable estimate of attention that also distinguished multiple strata on total scores, indicating that the MARS is sensitive to variations in levels of attention in the range of an acutely injured or moderately to severely impaired population.
In constructing the MARS, we hypothesized the presence of an overall dimension of attention, composed of more tightly correlated clusters of items representing attentional subfunctions.17 Our intent was to represent the varied facets of attention including arousal, alertness, and orienting; focused attention and internal and external distractibility; cognitive speed; sustained attention, vigilance, and persistence; working memory and attention span; shifting and dividing attention; and initiation, performance consistency, and the ability to mobilize and direct attentional resources. (Hemispatial attention was included in the first list of “factors” desired for MARS coverage, but was dropped because it relates to focal brain damage, whereas the others were expected to relate to the extent of diffuse damage.) Several models of attention have hypothesized such subcomponents, based on theory, neurophysiologic and psychopharmacologic research, clinical observations, and/or factor analytic studies.18, 19 An influential and widely replicated model of attention has been put forth by Mirsky et al20 that is based primarily on factor analytic studies of neuropsychologic test scores of people with psychiatric and neurologic disorders and of healthy children. Mirksy21 summarized the model as including 4 factors: focus/execute (this includes cognitive speed), sustain, shift, and encode; that is, hold information in mind long enough to perform a cognitive operation. A fifth element, stability, has also been included.21 The 4-factor model has been replicated in several investigations, including a confirmatory factor analysis (CFA) using a large sample of adolescents with different psychiatric disorders.22 The latter study showed that, compared with the 4-factor model, a slightly better fit was obtained with 3 factors: sustained attention/vigilance, encoding, and “complex sustained effort,” an amalgam of perceptual speed and cognitive flexibility.
Our initial Rasch analysis of the MARS supported the existence of a general attention dimension after 3 misfitting items were eliminated, and located the remaining items on this dimension.17 Because we had originally hypothesized an overall dimension of attention with more tightly clustered subconstructs, our purpose in the current study was to identify the factors or subdimensions of disordered attention in the early stages of TBI, as measured by the MARS. Given the presence of an overarching dimension of attention, however, we hypothesized that any observed factors would be correlated with one another. We did not intend to test an existing factor model or to constrain the number of factors that would manifest. Instead, we used both exploratory factor analysis (EFA) and CFA to examine the underlying structure of disordered attention in this population. While the results were expected to be of theoretical interest with regard to the structure of attention as a cognitive construct, we also had an important clinical purpose in the study. The factor analysis was planned in conjunction with additional Rasch analyses to hone the 45-item research version of the MARS into a scale that was not only shorter, but would also allow clinicians and researchers to calculate more than 1 score (eg, relevant factor scores as well as a total). Factor scores would be useful both for tracking specific aspects of clinical improvement and for measuring the impact of treatments that target particular aspects of attention.
Methods  Participants Participants were 372 inpatients with moderate to severe TBI, admitted for acute rehabilitation to 8 Traumatic Brain Injury Model System (TBIMS) centers over a 3-year period (May 1999−May 2002). Most of the participants (315, 85%) were enrolled in the TBIMS longitudinal database project supported by the National Institute on Disability and Rehabilitation Research.23 The remaining 57 participants, who were included from 3 of the collaborating centers, were not enrolled in the TBIMS database because they were treated acutely in a non-TBIMS trauma center before their admission to the TBIMS rehabilitation center. In other respects, they were comparable to the TBIMS sample, and met all of the following inclusion criteria. All participants had sustained a TBI as evidenced by loss of consciousness, focal brain lesion on neuroimaging, and/or abnormality on neurologic examination consistent with external trauma; all were at least 16 years of age. Further inclusion criteria stipulated a Rancho Los Amigos Level of Cognitive Functioning (RLA) score greater than III at the time of rating on the MARS, and stable dosing levels on all psychotropic medications (if any were prescribed) during the rating period. Because all participants were rated on the MARS by their treating occupational and physical therapists over a 3-day period (see below), patients who were otherwise eligible were excluded if they were not receiving both occupational therapy (OT) and physical therapy (PT) treatment, or if those treatments were given entirely in an unusual setting such as a private treatment room, or if there had been changes in their assigned OT or PT staff during the rating interval. We did not obtain separate informed consent for this study because it was subsumed under the overall TBIMS informed consent process. The 57 participants not included in the TBIMS were rated anonymously under an informed consent waiver on the condition that no information be recorded about them other than their MARS scores. Therefore, descriptive information about the sample as a whole could be derived only from the 315 TBIMS enrollees. The demographic and injury characteristics of the TBIMS study sample are summarized in table 1. Not all injury severity indices were available for all participants. The Glasgow Coma Scale (GCS)24 was scored on admission to the emergency department in the acute care hospital. Time to follow commands was defined as the interval in days between the TBI and the date the patient followed simple commands on 2 consecutive assessments within 24 hours. As is consistent with other samples of moderate to severe TBI, the majority of participants were male (70%), and most had been injured in motor vehicle collisions (also 70%). Admission GCS scores varied across the range of the scale, although the average score (8) indicated severe injury, as did the average time to follow commands greater than 1 week. Because the sample consisted of patients admitted for inpatient rehabilitation, cases with high GCS scores were likely to have had complications such as subdural hematoma, or to have sustained focal injury such as a gunshot wound to the brain. Participants were scored on the MARS from 8 to 99 days postinjury (mean, 33d). | | |  | Characteristics | Values |  |
 | Age at injury (y) | |  |
 | Mean ± SD | 37.0±18.3 |  |
 | Range | 16–90 |  |
 | Sex, n (%) | |  |
 | Male | 222 (70) |  |
 | Female | 93 (30) |  |
 | Race, n (%) | |  |
 | White | 222 (70) |  |
 | African American | 73 (23) |  |
 | Other ethnic minority | 20 (7) |  |
 | Etiology | |  |
 | Motor vehicle collision (includes pedestrian injuries), n (%) | 222 (70) |  |
 | Fall | 48 (15) |  |
 | Blunt assault | 21 (7) |  |
 | Gunshot wound | 8 (3) |  |
 | Sports/other | 14 (4) |  |
 | Missing/unknown | 2 (<1) |  |
 | GCS on emergency admission (n=265) | |  |
 | Mean ± SD | 8.7±4.2 |  |
 | Median | 8.0 |  |
 | Range | 3–15 |  |
 | Time to follow commands (d) (n=301) | |  |
 | Mean ± SD | 8.4±11.6 |  |
 | Range | 0.5–76.0 |  | | | |
Instrument The research version of the MARS is a 45-item, Likert-type rating scale; its development process has been previously described.17 (The full research version of the scale included 8 “control” or filler items not directly related to attention, for a total of 53 items. The filler items, however, were not analyzed in this study.) The response choices lie on a 5-point scale, ranging from “definitely true” to “definitely false” with an intermediate rating of “sometimes true, sometimes false.” Some items are written so that the “false” response choices indicate an attention deficit (eg, maintains eye gaze appropriate to task at hand), while for other items, “true” responses indicate an attention deficit (eg, persists with an activity or response after being told to stop). Procedure The study protocol called for patients to be rated on the MARS over a 3-day period independently by their treating occupational and physical therapists, with ratings to be completed between day 3 and day 17 after admission to acute rehabilitation. Thus, the last day for the rating period to begin would be day 15. We chose this arbitrary time window to ensure that the rating therapists would be familiar with the patient and to minimize the possibility that they might have known a given patient across widely differing functional levels, which might contaminate ratings. As the project progressed, the upper end of the window was relaxed somewhat to include severely impaired participants, some of whom became more responsive (and thus met the RLA score > III criterion) later than 15 days into their rehabilitation stay. In a few other cases, participants were rated beyond the 17-day cutoff because of experimenter error in calculating the interval, or because of administrative or logistical problems. Of the 315 participants for whom data were available with which to calculate the admission-to-rating interval, 279 (89%) were rated within the 3- to 17-day window. Four participants were inadvertently rated on days 1 and 2, 23 were rated between days 18 and 21, and the remaining 7 were rated beyond that time (days 22–71). We decided to retain all early and late ratings in the analysis because of the small number of outliers. On enrollment of eligible patients into the TBIMS (or admission to acute rehabilitation, for the participants who were not in the TBIMS), a research coordinator attempted to identify a rating interval of 3 consecutive days within the required time window during which the OT and PT staff would be constant, and during which psychotropic medications would also be stable. In some institutions this meant meeting in advance with the physician to discuss medication plans; in other centers, the ratings were carried out regardless but discarded if medication changes were documented during the rating interval. The research coordinator in each institution met with the physical and occupational therapists to explain the project and answer questions about the rating instrument. Beyond repeating this information as necessary, no formal training was conducted on how to use the MARS. Each therapist was asked to complete the rating form independently, based on his/her interactions with and observations of the patient during the course of routinely scheduled therapy sessions during the 3 days. Deidentified rating sheets were faxed to the lead center, where the data were stored in an Access database.a Data Analysis Overview A 3-step process guided our overall data analytic strategy. In the first step, we conducted an EFA on a complete set of 372 MARS ratings from 1 discipline (PT). We then conducted a CFA to assess the fit of the observed factor structure in the other sample of ratings on the same 372 people that were conducted by the occupational therapists. EFA is particularly useful in the early stages of test development because the results may assist in developing hypotheses that can be tested by other methods, such as CFA.25 CFA can be useful in refining and evaluating a measure because it provides a rigorous test of both the conceptual and quantitative assumptions underlying the measure. In step 2, using the results of these 2 analyses, we identified items for tentative factor scales that capture discrete (yet intercorrelated) subcomponents of disordered attention. In the third step, we conducted iterative Rasch analyses on the items identified in these factor scales and examined coverage across rating scale thresholds (transition points between adjacent categories). This was done to determine “gaps” in item difficulty in the subscales and in the overall MARS, thereby assuring the availability of items at all points along the continuum. We reviewed the results of our original full Rasch analysis17 to select items that did not load on a single factor, but were at levels of difficulty appropriate for filling those gaps. Thus, we attained more diverse “coverage” for the overall construct of attention at levels of difficulty not represented by items loading strongly on a given factor. Prior to data analysis, item scores were converted in directionality as necessary so that for all items, a higher score was indicative of better attention. Two thirds of the item scores were converted. The original scoring direction of the items was found not to contribute substantial variance and was not considered further. Factor analyses We used LISREL, version 8.40,b for both the EFA and CFA. The 2-stage least-squares (TSLS) estimation method was used in the EFA. TSLS is often useful in EFA when models are tentative or when there are sources of error in addition to statistical error.25 Thus, TSLS can help investigators determine whether a model is reasonable. In the absence of a single optimal method for determining the number of factors in a measure, the number of factors was specified a priori (up to 7 factors) and models were then fitted and evaluated in the EFA. The primary procedure we used to decide the number of factors to retain was based on statistical fit, using the chi-square statistic for testing the fit of k factors and the Steiger root mean squared error of approximation (RMSEA). The first criterion, chi-square test of k factors greater than .10, guarantees that one stops at k factors if the overall fit is good. The second criterion specifies that the number of factors will not be increased unless the improvement in fit is statistically significant at the 10% level. The third criterion, RMSEA less than .05, guarantees that one does not extract too many factors. Based on theoretical considerations and the findings from the EFA, we specified a 3-factor model for the CFA. Two alternative models were also tested: a 1-factor model and a second-order factor model. The multifactor model would need to show substantially better fit than the 1-factor model to provide support for the multidimensional nature of the MARS. We anticipated, however, that the factors would be moderately intercorrelated. Hence, we also tested a second-order factor model. A higher-order model is present when first-order factors are explained by some higher-order factor structure. In this case, components of attention are subsumed under a second-order factor, “general attention.” When factors are intercorrelated, higher-order factor models are proposed because a second-order general factor can better and more parsimoniously account for the relationships among the factors than can a first-order factor model. That is, factors can be highly correlated with each other because they are related to a general factor, yet these factors may still retain a significant amount of unique variance that is unrelated to the general factor. Higher-order factor models provide a way to evaluate both common and unique variance represented by the factors. Rasch analysis The rating scale model, one of the models within the Rasch measurement family, was implemented using Winsteps, version 3.45.26,c The rating scale model estimates 2 elements that are calibrated on a single common scale: the difficulty level of the items in the set and the ability level of participants in the sample. We used 2 criteria to evaluate the quality of measurement using the MARS; both can be applied to person and item measures. The first, called separation, is the ratio of variance in the measures to their standard error; values greater than 2 are considered acceptable in that they can differentiate at least 3 levels of the trait (in this case, attention).27 Separation is also reported in terms of reliability, with values interpreted similarly to the Cronbach α (values >.80 are considered acceptable; values >.90 are considered good). The second criterion, called fit, represents the extent to which the items fit model expectation; 2 types of fit (infit, outfit) are examined because they are sensitive to different violations of the model. These statistics are reported in terms of chi-square–based mean square values, with values between 0.6 and 1.4 considered acceptable for items using a rating scale.28 To eliminate problems with autocorrelations and to equally represent OT and PT ratings in the Rasch analyses, the ratings were combined and a split-half sample was randomly selected. Thus, a randomly selected 186 participants were represented by their PT ratings and the remaining 186 by their OT ratings. This ensured that the ratings were equally weighted by discipline and that each subject was included only once. We adopted this strategy rather than the one we used in the factor analyses (ie, separate analyses on the 2 samples) because the Rasch analysis was concerned with evaluating item difficulty and not with potential effects of raters or disciplines. This latter concern was addressed in our previous study of the MARS.17
Results  Exploratory Factor Analysis Results of the first-pass EFA on the PT sample suggested that 4 factors should be extracted. The fourth factor, however, had only a few items, and those also had significant cross-loadings on other factors. Hence, we retained 3 factors and used an oblique promax rotation to achieve simple structure. We retained items that tended to load clearly on a single factor as long as the groupings made conceptual or theoretical sense. The 3 factors were interpreted as follows: factor 1 contained items describing restlessness/distractibility (10 items); factor 2 was termed initiation (4 items); and factor 3 seemed to represent sustained/consistent attention (5 items). Confirmatory Factor Analysis We evaluated the 19-item scale derived from the EFA and its associated 3-factor model, using the OT sample via CFA. The data had substantial non-normality, even after we attempted to normalize the items prior to our analysis. Thus, we derived a polychoric correlation matrix and an asymptotic covariance matrix, and used the weighted least squares (asymptotic distribution free) method to estimate the parameters in all 3 CFA models. In our primary 3-factor model, we allowed the error terms to correlate only among items loading on the same factor. Correlated error terms represent the hypothesis that unique variances of the associated indicators overlap. That is, they measure something in common other than the latent constructs that are represented in the model. In this case, it is likely that items loading on the same factor may be susceptible to the same response set. Table 3 presents a panel of fit indexes for all 3 models, based on recommendations by Hu and Bentler29 for assessing model fit derived from stimulation. Both global and component fit indexes indicated that our primary 3-factor model was the best fitting model. For example, the non-normed fit index, comparative fit index, and relative fit index all exceeded .95, RMSEA was less than .06, and standardized root mean squared residual was less than .08. The Bayesian information criterion difference of 290.83 between the 3-factor model and the next best fitting model—the 1-factor model—provided strong support for preferring the 3-factor model over the alternative models.30 The second-order model was particularly problematic in terms of Heywood cases, that is, negative variance estimates. Attempts to provide alternative start values prior to model estimation did not remedy this situation. Hence, we restricted our further focus to the 3-factor model. | | |  | Model | χ2 Test (df) | P | NNFI | CFI | RNI | BIC | AIC | RMSEA | 90% CI RMSEA | SRMR |  |
 | 3-factor | 462.83 (134) | .00 | .98 | .99 | .97 | 794.29 | 574.83 | .05 | .04–.06 | .06 |  |
 | 1-factor | 771.42 (137) | .00 | .95 | .97 | .94 | 1085.12 | 844.42 | .08 | .07–.09 | .08 |  |
 | Second-order | 1078.92 (165) | .00 | .94 | .95 | .93 | 1226.89 | 1128.92 | .09 | .08–.10 | .09 |  | | | |
Table 4 shows the structure coefficients and squared multiple correlations coefficients for the items. The correlation coefficient can be viewed as a measure of reliability whereas the structure coefficients assess aspects of validity in determining which items are strong indicators of the latent construct. The numerals in bold indicate which items loaded on which factor. For example, 4 items loaded on the sustained/consistent attention factor and not on the other 2 factors. It should be pointed out that constraining an item to load on only 1 factor does not constrain the factor structure coefficients to be zero if the factors are correlated. In fact, examining the factor structure coefficients associated with all of the factors can be essential for accurate interpretation. | | |  | Item | RD | Initiation | SC | R2 |  |
 | Is restless or fidgety when unoccupied | .66 | (.32) | (.51) | .44 |  |
 | Tends to speak less than he/she is capable of | (.19) | .39 | (.27) | .15 |  |
 | Performance is best early in the day or after a rest | (.15) | (.13) | .19 | .03 |  |
 | Initiates communication with others | (−.31) | −.63 | (−.43) | .40 |  |
 | Is restless or fidgety during tasks | .71 | (.35) | (.55) | .51 |  |
 | Persists with an activity or response after being told to stop | .73 | (.36) | (.56) | .53 |  |
 | Interrupts tasks to express physical complaints that are unrelated to task | .62 | (.30) | (.48) | .38 |  |
 | Attends to nearby conversations rather than the current task or conversation | .63 | (.31) | (.49) | .40 |  |
 | Tends not to initiate tasks which are within his/her capabilities | (.43) | .87 | (.59) | .75 |  |
 | Speed or accuracy deteriorates over several minutes on a task, but improves after a break | (.34) | (.30) | .44 | .19 |  |
 | Is able to perform or interact for only a few moments before fatiguing or stopping | (.60) | (.53) | .78 | .61 |  |
 | Makes unnecessary comments/conversation that interferes with task performance | .75 | (.37) | (.58) | .57 |  |
 | Performance of comparable activities is inconsistent from one day to the next | (.49) | (.44) | .64 | .41 |  |
 | Perseverates on previous topics of conversation or previous actions | .77 | (.38) | (.59) | .59 |  |
 | Initiates activity (whether appropriate or not) without cueing | (−.44) | −.89 | (−.61) | .80 |  |
 | Sustains activity even when people enter the room | −.74 | (−.36) | (−.57) | .55 |  |
 | Begins to touch or manipulate nearby objects not related to task | .75 | (.37) | (.58) | .56 |  |
 | Performance of comparable activities is relatively consistent from 1 time of day to another | (−.59) | (−.52) | −.77 | .60 |  |
 | Tends to focus on details that are irrelevant for the task or activity | .81 | (.40) | (.62) | .65 |  | | | |
Analysis of the parameter estimates for the 3-factor model revealed no inadmissible estimates (eg, correlations exceeding 1.0) or Heywood cases (ie, negative covariances). All parameter estimates were statistically significant. Note, however, the R2 values for 2 items on the sustained/consistent attention factor (Performance is best early in the day or after a rest, Speed or accuracy deteriorates over several minutes on a task, but improves after a break) and 1 item on the initiation factor (Tends to speak less than he/she is capable of) were low (<.20). This suggested unreliability or a lack of commonality with the other items on their respective factors. The sustained/consistent factor correlated highly with the restlessness/distractibility factor (r=.75) and the initiation factor (r=.66). Initiation and restlessness/distractibility correlated moderately (r=.46). In addition, examination of the factor structure revealed that most MARS items had substantial shared variance across factors; for example, item one (Performance of comparable activities is relatively consistent from 1 time of day to another), coefficients were −.59, −.52, and −.77. In summary, the combined results of the EFA and CFA suggested evidence for the hypothesized multidimensionality in the MARS. The best fitting model, however, produced 3 factors with relatively high intercorrelations among factors, and shared item variance across factors. From a psychometric standpoint, the restlessness/distractibility factor was the strongest factor, with a larger number of coherent items than the other 2 factors. Rasch Analysis As previously noted, one purpose of our study was to reduce redundancy in item content and minimize rater burden, but to still produce a reliable estimate of attention and adequately represent the identified attention subdomains. An item set that could achieve these goals would need to cover a wide range of difficulty (ie, without floor or ceiling effects) with items spaced throughout the distribution, avoiding gaps. A gap is defined as a difference in adjacent item difficulty estimates that is greater than their joint standard errors of measurement. The Rasch analyses were begun with the items that measured each of the 3 identified attention factors. This set of 19 items exhibited good separation and reliability (person, 2.33/.84; item, 4.97/.96; using a criterion of >2.00) and good overall fit, although 1 item (Speaks as much as is capable of) significantly misfit (infit, 1.95; outfit, 2.02; using a criterion of >1.40). We then undertook to delete factor items that were redundant with regard to difficulty level, and to add items from the original MARS set that filled gaps in the overall coverage of the 19-item scale. Items were selected from each factor that covered an adequate range of content and difficulty, and were recombined into a single scale for reanalysis. To address shortcomings in the overall distribution of item difficulty, items that did not load on 1 of the 3 factors but fit the overall attention domain were identified and added to the item set. These additional items were selected both for their difficulty level, their loading patterns on more than 1 identified factor, and their face validity for including item content that was not already represented by factor items. For example, some “higher-level” items, such as detecting one’s own errors, had shown multifactor loadings and thus were not considered members of any of the 3 factors, yet were considered important aspects of attention to be included. Altogether in these calibrations, 5 items were deleted from the original 3 factors (3 redundant items from factor 1, 1 misfitting and redundant item from factor 2, 1 redundant item from factor 3); 4 nonfactor items were added to increase coverage at the extremes; 6 nonfactor items were added to fill gaps in the difficulty distribution across the rating scale; and 2 items that misfit after these additions and deletions were deleted. When these changes were made, a set of 22 items was identified that contained no item misfit or distributional gaps. Eleven of these represented the 3 factors, with the other 11 items drawn from the original MARS. The items and their associated factors are shown in table 5. These 22 items were able to differentiate people into at least 3 distinct strata of attention (person separation and reliability, 2.60/.87) and covered a wide range of content and item difficulty (item separation and reliability, 8.04/.98). Because they contained so few items, the factors were not expected to reliably differentiate people along the attention scale; however, despite their brevity, these factors covered an acceptable range of difficulty. Person and item separation and reliability for the 3 factors was 1.31 and 0.63 and 3.42 and 0.92 for restlessness/distractibility; 1.23 and 0.60 and 3.42 and 0.92 for initiation; and 0.89 and 0.44 and 7.06 and 0.98 for sustained/consistent attention. (More detailed information about the results of all data analyses, including full results of the EFA, Rasch variable maps, and items not retained in the scale, are available from the first author on request.) | | |  | Item | Factor |  |
 | Is restless or fidgety when unoccupied | RD |  |
 | Sustains conversation without interjecting irrelevant or off-topic comments | NA |  |
 | Persists at a task or conversation for several minutes without stopping or “drifting off” | NA |  |
 | Stops performing a task when given something else to do or to think about | NA |  |
 | Misses materials needed for tasks even though they are within sight and reach | NA |  |
 | Performance is best early in the day or after a rest | SC |  |
 | Initiates communication with others | IN |  |
 | Fails to return to a task after an interruption unless prompted to do so | NA |  |
 | Looks toward people approaching | NA |  |
 | Persists with an activity or response after being told to stop | RD |  |
 | Has no difficulty stopping one task or step in order to begin the next one | NA |  |
 | Attends to nearby conversations rather than the current task or conversation | RD |  |
 | Tends not to initiate tasks which are within his/her capabilities | IN |  |
 | Speed or accuracy deteriorates over several minutes on a task, but improves after a break | SC |  |
 | Performance of comparable activities is inconsistent from 1 day to the next | SC |  |
 | Fails to notice situations affecting current performance, eg, wheelchair hitting against table | NA |  |
 | Perseverates on previous topics of conversation or previous actions | RD |  |
 | Detects errors in his/ her own performance | NA |  |
 | Initiates activity (whether appropriate or not) without cueing | IN |  |
 | Reacts to objects being directed toward him/her | NA |  |
 | Performs better on tasks when directions are given slowly | NA |  |
 | Begins to touch or manipulate nearby objects not related to task | RD |  | | | |
The 22-Item Version of the MARS In summary, results of data analysis yielded a new MARS that is less than half the length of the research version and provides: full coverage of the attention dimension represented by the original scale and identified by Rasch analysis; 3 subscores for the correlated factors identified by the EFA and CFA; and a total score that incorporates all 3 factors, plus additional items that round out the difficulty level and multidimensionality of the construct measurement.
Discussion  Our objective in this study was to examine the factor structure of attention disorders in acute TBI using the MARS, a recently developed observational rating scale of attention-related behaviors. One underlying purpose was to investigate whether attention disorders in this population are unidimensional, differing mainly in severity of the disorder, or whether separate subcomponents would emerge, as in previous studies of attention in several populations.21, 22, 31 Our main hypothesis, that there would be discriminable but correlated factors of attention dysfunction, was supported. The best-fitting solution revealed 3 factors that we labeled restlessness/distractibility, initiation, and sustained/consistent attention. Another purpose of the study was to determine whether clarifying the factor structure would help to develop a clinically useful scale for objective measurement of different types of attention dysfunction. The MARS that has been revised as the result of these analyses now contains 22 items, 11 of which may be used to produce factor scores. Thus, the new scale should be considerably easier to complete and should also provide a way to assess progress on specific aspects of attention that may be differentially responsive to treatments. Comparison of the factors in this study with attention factors from previous research is of interest. In making such a comparison, one must remember the different sources of data entering into various factor-analytic studies of attention. Ours is a study of observational ratings. Thus, one cannot anticipate that cognitive processes that are separable through information processing tasks or functional imaging studies will necessarily produce observationally separable behavioral groupings. With this in mind, the factors in the Mirsky21 model—focus/execute, sustain, shift, and encode— do not conform to our results except for the sustain factor. The focus/execute factor, however, may be somewhat akin to our restlessness/distractibility factor. With respect to other previous models, the results reported here did not show evidence of a “complex sustained effort” factor as reported by Pogge et al.22 Our analyses indicated that many items that were apparently related to higher-level attention functions did not load on any 1 factor. Nor did we find, as suggested by Spikman et al,31 that attention is a 2-factor process consisting of elements of automatic and controlled processing. Rather, our results suggest that attention disorders in the acute stages of TBI reflect 3 distinct, yet interrelated processes: the ability to initiate without cueing; the ability to inhibit perseverative, restless or irrelevant responses; and the ability to sustain attention and persist on tasks. These factors make intuitive sense from a clinical standpoint because some patients in the early stages of recovery from TBI do have primary difficulty initiating actions. Others initiate almost too well, but have primary difficulty inhibiting extraneous responses. Still others may initiate tasks adequately but then “drift off,” or demonstrate marked inconsistency as the cardinal feature of their attention disorder. It must be remembered, however, that the factors in this study were predicted to intercorrelate, and that in a disorder with diffuse pathophysiology such as TBI, unitary attention deficits might be rare. Nonetheless, if patients with TBI display predominant deficits in one or another MARS factor, this information could be helpful in conceptualizing the treatments that might be most helpful, educating caregivers about the nature of the problem, and tracking improvement in response to targeted treatments. Given that different neural networks subserving attention are controlled by different neurotransmitter systems, it is possible that scores on particular attention factors respond differentially to different pharmacologic treatments. Thus, in the future, the MARS may be used to prioritize the attention areas that are most problematic, and to select the most appropriate treatment. In terms of the meaning of these results for understanding the overall structure of attention, it must be remembered that a factor structure derived from an impaired population does not necessarily reflect on the structure of the same cognitive construct in an intact sample. For example, Delis et al32 demonstrated that tasks of immediate and long-term memory, which have been shown to dissociate reliably in patients with severe memory disorders, loaded on the same factor in samples with normal memory function. Thus, our results cannot be used to infer factors of initiation, inhibition, and persistence in normal attention. In fact, before even assuming that these findings apply to disordered attention in TBI as a whole, they would need to be replicated in other samples of patients at different levels of severity and acuity, and including different measures of the 3 putative factors. It may at first appear contradictory that Rasch analysis would support a single dimension of attention while factor analysis would identify 3 separable constructs derived from an overlapping data set. This finding, however, is consistent with our overall hypothesis of a loose dimension of attention function, formed of more tightly linked clusters of subfunctions. Indeed, the fact that the 3 factors were intercorrelated is consistent with this idea, as are current understandings of the interactive neural networks controlling attention. Several limitations of the current study must be kept in mind when its results are being interpreted. We have already alluded to some limitations inherent in an observational rating scale. The difficulty range of the scale and the factor groupings within it are necessarily limited to those attentional issues that an outsider can observe. Thus, components of attention that can only be teased apart through more fine-grained physiologic methods may not be apparent on the MARS. Similarly, some of the subtle attention deficits sometimes reported by those with mild TBI (eg, a sense that paying attention is effortful or fatiguing) are unlikely to be seen by outside observers. These kinds of deficits would likely require either a validated self-report method, or assessment through particularly demanding neuropsychologic measures. Despite these limitations, we believe that the MARS is likely to fill an important assessment niche because it may be used with patients who have difficulty cooperating either with neuropsychologic testing or self-report. In such patients, clinicians already reach conclusions and select treatments based on behavioral observation. Attempts to make this observation more systematic and to provide quantitative information on more severe attention deficits are likely to be valuable, not only for monitoring improvement and treatment response, but also for detecting neurologic deterioration. Much additional work remains to complete the validation of the MARS. We have not yet administered the scale in its 22-item version because we want to ensure that its psychometric properties are stable after eliminating the redundant items. In addition, although the item content and factor structure have strong face validity, we have not yet validated the MARS in other ways. In our current research, we are attempting to demonstrate that MARS scores change during recovery, that the scores correlate with existing attention measures, and that the scores are responsive to treatments believed to have an impact on attention. All of these steps are necessary to fully characterize the appropriate uses of the scale. We are also currently exploring the agreement among rehabilitation disciplines in addition to OT and PT in order to develop appropriate recommendations concerning clinical use of the scale. In addition to exploring effects of discipline (and treatment setting) on ratings, it would be valuable to determine the minimum rating interval needed for a stable rating, and whether training or instructions of various kinds improve reliability. Finally, in addition to examining convergent and divergent validity with existing measures of attention and other cognitive constructs, further research on the MARS might explore its use in outpatient populations at different times post injury, particularly for subjects with moderate to severe attention impairments. Studies of the use of the MARS in populations other than TBI, such as anoxic brain injury, might also be warranted. We hope that eventually the MARS will prove useful for both clinical and research purposes. From a clinical perspective, we foresee being able to use the MARS to quantify attention problems that are of clinical concern, to track improvements in them, and to judge responses to treatment. Notably, the MARS may be useful at an earlier point in recovery and with more severe impairments relative to other available tools. The use of a quantitative multidisciplinary scale may allow clinical discussions of attention problems to be more precise, objective, and comprehensive. Moreover, the scores on individual MARS factors may help to highlight areas of disordered attention that are of the highest priority to address. If different treatments ultimately prove to be effective for different types of attention deficits, these factor scores may also help to select treatments. The severity and nature of attention deficits as assessed by the MARS may also provide guidance regarding what strategies should be used for teaching other skills that are dependent on adequate levels of attention. With respect to research uses, the MARS, combined with neuropsychologic and laboratory measures of attention, may shed light on the interrelationships among observable behaviors related to attention and underlying cognitive processes. The MARS may serve as an eligibility-screening tool for studies related to attention deficits, and as an outcome measure for attention treatment trials targeting relatively severe impairments. For all of these applications, the development, refinement and validation of quantitative observational methods for assessing behaviors related to cognition may provide a useful supplement to psychometric approaches to enhance both neuropsychologic practice and research.
Conclusions  Disordered attention in the acute stage of TBI does not appear to be a unidimensional phenomenon, or one differing only as to the severity of the problem. Rather, there are differences in distractibility, initiation deficits, and difficulties with sustained/consistent attention that are measurable using therapist observations based on daily interactions in the inpatient setting. A 22-item version of the MARS produces factor scores for these 3 dimensions of disordered attention and an overall score that includes additional types of attention deficits. Additional research is underway to provide concurrent validation for this scale, which shows promise as a clinical and research tool for studying attention deficits after TBI and evaluating their response to treatments.
Suppliers
Acknowledgments  We thank the TBIMS project directors who facilitated the collection of data at their institutions, and the many clinicians whose ratings made this project possible. Caron Morita and Gemma Baldon made valuable contributions to data collection and management, and Brigid Waldron assisted with manuscript preparation. References  1.
1
Jacobs H
.
The Los Angeles Head Injury Survey
(procedures and initial findings)
.
Arch Phys Med Rehabil
. 1988;69:425–431
.
MEDLINE 2.
2
McKinlay W
, Brooks D
, Bond M
, Martinage D
, Marshall M
.
The short term outcome of severe blunt head injury as reported by relatives of the injured persons
.
J Neurol Neurosurg Psychiatry
. 1981;44:527–533
.
MEDLINE |
CrossRef
3.
3
Ponsford J
, Kinsella G
.
Attention deficits after closed-head injury
.
J Clin Exp Neuropsychol
. 1992;14:822–838
.
MEDLINE |
CrossRef
4.
4
Whyte J
, Polansky M
, Fleming M
, Coslett H
, Cavallucci C
.
Sustained arousal and attention after traumatic brain injury
.
Neuropsychologia
. 1995;33:797–813
.
MEDLINE |
CrossRef
5.
5
Whyte J
, Fleming M
, Polansky M
, Cavallucci C
.
The effects of visual distraction following traumatic brain injury
.
J Int Neuropsychol Soc
. 1998;4:126–136
.
6.
6
McDowell S
, Whyte J
, D’Esposito M
.
Working memory impairments in traumatic brain injury
(evidence from a dual-task paradigm)
.
Neuropsychologia
. 1997;35:1341–1353
.
MEDLINE |
CrossRef
7.
7
Raz A
.
Anatomy of attentional networks
.
Anat Rec B New Anat
. 2004;281:21–36
.
MEDLINE 8.
8
Whyte J
, Hart T
, Vaccaro M
, et al.
The effects of methylphenidate on attention deficits after traumatic brain injury
(a multi-dimensional randomized controlled trial)
.
Am J Phys Med Rehabil
. 2004;83:401–420
.
MEDLINE |
CrossRef
9.
9
Riccio C
, French C
.
The status of empirical support for treatment of attention deficits
.
Clin Neuropsychol
. 2004;18:528–558
.
10.
10
Whyte J
Fifth International Association for the Study of Traumatic Brain Injury
.
Attentional function after traumatic brain injury
(what’s impaired, what’s preserved and why?)
.
In:
Ponsford J
, Snow P
, Anderson V
editor.
International perspectives in traumatic brain injury
. Melbourne: Aust Acad Pr; 1998;p. 154–159
.
11.
11
Kennedy R
, Nakase-Thompson R
, Sherer M
, Nick T
.
Use of the cognitive test for delirium in patients with traumatic brain injury
.
Psychosomatics
. 2003;44(4):1–7
.
MEDLINE |
CrossRef
12.
12
Boake C
, Millis S
, High W
, et al.
Using early neuropsychological testing to predict long-term productivity outcome from traumatic brain injury
.
Arch Phys Med Rehabil
. 2001;82:761–768
.
Abstract | Full Text |
Full-Text PDF (58 KB)
|
CrossRef
13.
13
Whyte J
, Schuster K
, Polansky M
, Adams J
, Coslet HB
.
Frequency and duration of inattentive behavior after traumatic brain injury
(effects of distraction, task and practice)
.
J Int Neuropsychol Soc
. 2000;6:1–11
.
MEDLINE 14.
14
Hart T
, Giovanenetti T
, Montgomery M
, Schwartz M
.
Awareness of errors in naturalistic action after traumatic brain injury
.
J Head Trauma Rehabil
. 1998;13(5):16–28
.
MEDLINE |
CrossRef
15.
15
Hinshaw S
, Nigg J
.
Behavior rating scales in the assessment of disruptive behavior problems in childhood
.
In:
Shaffer D
, Lucas CP
, Richters JE
editor.
Diagnostic assessment in child and adolescent psychopathology
. New York: Guilford Pr; 1999;p. 91–126
.
16.
16
Ponsford J
, Kinsella G
.
The use of a rating scale of attentional behavior
.
Neuropsychol Rehabil
. 1991;1:241–257
.
CrossRef
17.
17
Whyte J
, Hart T
, Bode R
, Malec JF
.
The Moss Attention Rating Scale (MARS) for traumatic brain injury
(initial psychometric assessment)
.
Arch Phys Med Rehabil
. 2003;84:268–276
.
Abstract |
Full-Text PDF (135 KB)
|
CrossRef
18.
18
van Zomeren A
, Brouwer W
.
Clinical neuropsychology of attention
. New York: Oxford Univ Pr; 1994;
.
19.
19
Spikman J
, Timmerman M
, van Zomeren A
, Deelman B
.
Recovery versus retest effects in attention after closed head injury
.
J Clin Exp Neuropsychol
. 1999;21:585–605
.
MEDLINE |
CrossRef
20.
20
Mirsky A
, Anthony B
, Duncan C
, Ahearn M
, Kellam S
.
Analysis of the elements of attention
(a neuropsychological approach)
.
Neuropsychol Rev
. 1991;2:109–145
.
MEDLINE |
CrossRef
21.
21
Mirsky A
.
Disorders of attention
(a neuropsychological perspective)
.
In:
Lyon GR
, Krasnego NA
editor.
Attention, memory and executive function
. Baltimore: PH Brookes; 1996;p. 71–95
.
22.
22
Pogge D
, Stokes J
, Harvey P
.
Empirical evaluation of the factorial structure of attention in adolescent psychiatric patients
.
J Clin Exp Neuropsychol
. 1994;16:344–353
.
MEDLINE |
CrossRef
23.
23
Bushnik T
.
Introduction
(the traumatic brain injury model systems of care)
.
Arch Phys Med Rehabil
. 2003;84:151–160
.
Abstract |
Full-Text PDF (44 KB)
|
CrossRef
24.
24
Teasdale G
, Jennett B
.
Assessment of coma and impaired consciousness
.
Lancet
. 1974;2:81–84
.
CrossRef
25.
25
Joreskog K
, Sorbom D
, du Toit S
, du Toit M
.
LISREL 8
(new statistical features)
. Lincolnwood: Scientific Software International; 2000;
.
26.
26
Wright B
, Masters G
.
Rating scale analysis
(Rasch measurement)
. Chicago: Mesa Pr; 1982;
.
27.
27
Smith EV
.
Evidence for the reliability of measures and the validity of measure interpretation
(a Rasch measurement perspective)
.
J Appl Meas
. 2001;2:281–311
.
MEDLINE 28.
28
Bond T
, Fox C
.
Applying the Rasch model
(fundamental measurement in the human sciences)
. Mahwah: Lawrence Erlbaum; 2001;
.
29.
29
Hu L
, Bentler PM
.
Cutoff criteria for fit indexes in covariance structure analysis
.
Structural Equation Modeling
. 1999;6(1):1–55
.
30.
30
Raferty AE
.
Bayesian model selection in structural equation models
.
In:
Bollen KA
, Long JS
editor.
Testing structural equation models
. Newbury Park: Sage; 1993;p. 163–180
.
31.
31
Spikman J
, Kiers H
, Deelman B
, van Zomeran A
.
Construct validity of concepts of attention in healthy controls and patients with CHI
.
Brain Cogn
. 2001;47:446–460
.
MEDLINE |
CrossRef
32.
32
Delis D
, Jacobson M
, Bondi M
, Hamilton J
, Salmon D
.
The myth of testing construct validity using factor analysis or correlations with normal or mixed clinical populations
(lessons from memory assessment)
.
J Int Neuropsychol Soc
. 2003;9:947–953
.
MEDLINE a Moss Rehabilitation Research Institute and Department of Rehabilitation Medicine, Jefferson Medical College, Thomas Jefferson University, Philadelphia, PA b Wayne State University School of Medicine, Detroit, MI c Rehabilitation Institute of Chicago and Feinberg School of Medicine, Northwestern University, Chicago, IL d Mayo Clinic, Rochester, MN e Mississippi Methodist Rehabilitation Center, Jackson, MS f Charlotte Institute of Rehabilitation, Charlotte, NC Reprint requests to Tessa Hart, PhD, Moss Rehabilitation Research Institute, 1200 W Tabor Rd, Philadelphia, PA 19141
Supported in part by the National Institute on Disability and Rehabilitation Research (grant no. H133A70033). No commercial party having a direct financial interest in the results of the research supporting this article has or will confer a benefit upon the author(s) or upon any organization with which the author(s) is/are associated. PII: S0003-9993(06)00099-2 doi:10.1016/j.apmr.2006.01.016 © 2006 American Congress of Rehabilitation Medicine and the American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved. | |
|