|
|
||||||||
Research Reports |
AB Deathe, BSc, MD, FRCP(C), is Associate Professor, Faculty of Medicine, Department of Physical Medicine and Rehabilitation, and Medical Director, SouthWestern Ontario Amputee Program, St Joseph Health Centre, University of Western Ontario, Parkwood Campus, London, Ontario, Canada
WC Miller, PhD, MSc, BScOT, is Assistant Professor, School of Rehabilitation Sciences, University of British Columbia, and Scientist, GF Strong Rehabilitation Research Lab and Centre for Clinical Epidemiology and Evaluation, Vancouver Coastal Health, Research Pavilion, 828 W 10th Ave, Vancouver, British Columbia, Canada V5Z 1L8 (bcmiller{at}telus.net)
Address all correspondence to Dr Miller
Submitted March 27, 2004;
Accepted December 20, 2004
| Abstract |
|---|
Key Words: Ambulation Amputation Outcome measure Reliability Validity
| Introduction |
|---|
|
|
|---|
Performance walk tests used in rehabilitation with a prosthetic device include measures of fixed walking time, such as the 2-Minute Walk Test,4 and measures of fixed walking distance, such as the Timed "Up & Go" Test (TUG),5 the 10-Meter Walk Test,6 and gradations of a 100-m walk.7 Studies8,9 have shown that objective performance-oriented tests have excellent measurement properties and are able to predict future function.
In our Regional Amputee Program, we use the TUG and the 2-Minute Walk Test to assess inpatient and outpatient prosthetic training. We have observed a ceiling effect with respect to the short version of the TUG (6-m total distance), particularly for elderly people who are more fit and for younger people with amputations. The longer 2-Minute Walk Test, however, is difficult to administer in the outpatient clinic setting, given that a 20-m hallway that is relatively free of other patients and staff is needed and given that clinics are required to conduct the test without distraction. Therefore, our need was to find or develop a test that could be easily and quickly administered concurrent with each patient visit that assisted with determining ability to walk with prosthetic devices. Our preference was to retain the transfer skill set of the TUG. Observation of a patient's gait during clinics showed that we usually asked the patient to get up and walk out of the room, turn and go down the hall, then return to the room and sit down. This walking path, representing an "L" configuration, required turns to both the right and the left. Standardizing the distance (3 x 7 m) led to the development of a potentially more demanding, yet practical, modification of the TUG that we have titled the L Test of Functional Mobility (L Test).
We believe that the L Test may be a useful indicator of mobility that will distinguish change in status for not only older people with amputations who are more disabled and have frail health, but also for fit older and younger individuals who are less disabled. The objectives of this study were to assess the reliability and the validity of data obtained with the L Test.
| Method |
|---|
|
|
|---|
Protocol
Subjects attending their regularly scheduled appointment who met the study criteria were identified, and the study process was explained. After completing their visit with the outpatient rehabilitation program team, individuals who consented to participate were taken to a quiet room beside the clinic. After demographic data were collected, all subjects completed a set of walk tests (time 1) and then completed a set of self-report questionnaires. Finally, a different rater conducted a second set of walk tests (time 2) in order to determine interrater reliability.
All of the walk tests were conducted according to the assigned standardized protocol. Walking aids were permitted, and the type of aid was recorded if used. A minimum of 2 minutes of rest between walk tests was provided. The order of the walk tests was not randomized in order to allow the raters to assess whether subjects who were more frail would have difficulty completing the longer and potentially more taxing 2-Minute Walk Test. All subjects completed testing in the same progression, starting with the walk tests (the TUG, followed by the 10-Meter Walk Test, the L Test, and the 2-Minute Walk Test), followed by completion of the self-report questionnaires (which took approximately 15 minutes to complete). Three trials of the L Test were completed during the time 1 and time 2 walk tests to determine whether a learning effect was observed when performing the test. Subjects who agreed to retesting returned 2 weeks later (time 3), when 3 consecutive L Test trials and a single trial of the other walk tests were conducted.
Measurement
A number of different measures were used to assess validity. Investigation of a variety of relationships from different perspectives enhances the support for the validity.10 The measures included clinically important variables (amputation cause and level), indicators of adaptation to the prosthesis and amputation (balance confidence), and variables that assess higher levels of function (social activity participation), as well as measures of mobility capability and performance (walk) tests.
Walk Tests
The L Test is a modified version of the TUG. The L Test incorporates 2 transfers and 4 turns, of which at least 1 would be to the opposite side. The total distance covered is a 20-m walk. Standardized instructions were developed and given to the subjects to ensure successful completion of the test. The L Test was demonstrated for each subject, and all subjects performed a practice trial. Additionally, we standardized the amount and type of encouragement based on findings by Guyatt et al11 suggesting that these factors can enhance performance. The time (in seconds, to the nearest 10th of a second) that it took for the subject to stand from an armless chair, walk 10 m (in the shape of an L) at the subject's usual walking speed, turn 180 degrees, and return 10 m (in the shape of an L) to a seated position was recorded.
The 2-Minute Walk Test is a measure of the distance (in meters) that an individual is able to walk at his or her "usual" pace. Starting from a standing position, subjects walked around pylons placed 25 m apart for 2 minutes. The distance covered was measured to the nearest 10th of a meter using a walk wheel. This test, which is often used clinically to determine the progress of walking endurance using a prosthetic device, is a shortened version of the original 12-Minute Walk Test. Six- and 2-minute versions of the 12-Minute Walk Test were developed and tested to provide clinicians with a test that took less time to complete.4 Data for the 2-Minute Walk Test have been found to be highly correlated with data for the 12-minute (r=.86) and 6-minute (r=.89) versions.4
The TUG assesses many of the components of basic mobility, including balance, transfers, walking, and turning while walking.5,12 The time (in seconds) that it takes for an individual to stand from a sitting position, walk a 3-m distance, turn, walk back to the chair, and sit down is recorded. The TUG was found to have excellent intrarater (r=.93) and interrater (r=.96) reliability, and evidence of convergent and divergent validity among a sample of people with lower-limb amputations has been reported.5
The 10-Meter Walk Test has principally been used to test individuals with neurologic impairment13,14 and lower-extremity amputation.6,15 The time taken to walk a distance of 10 m at the usual pace from a standing start was recorded. For this study, we embedded the 10-Meter Walk Test into the start of the 2-Minute Walk Test as other researchers14 also have done. Support for concurrent validity of data for the 10-Meter Walk Test among a sample of 53 people with either a TT or TF lower-limb amputation has been reported.15 Rossier and Wade14 also have demonstrated intrarater reliability of data for the 10-Meter Walk Test using correlation (r=.93) and Bland-Altman plots.
Self-report Variables
The Activities-specific Balance Confidence (ABC) scale asks subjects to rate their balance confidence on a scale of 0% to 100% on each of 16 mobility-related activities.16 Responses are summed and divided by 16 to provide an overall mean balance confidence score. Internal consistency (Cronbach alpha=.93), 4-week retest reliability (intraclass correlation coefficient [ICC]=.91), and minimal bias between the repeat measurements have been reported for a sample of subjects with lower-limb amputations.17
The 15-item Frenchay Activities Index (FAI) was used to measure the frequency of participation in activities ranging from preparing meals to working over the past 3 to 6 months.1820 The individual item responses of the FAI capture frequency, with responses ranging from 0 ("never or none") to 3 ("daily or weekly"). A summary score is derived by adding the total items, which range from 0 ("no activity") to 45 ("very high participation"). Internal consistency (Cronbach alpha=.84), intrarater reliability (ICC=.78), and support for validity based on correlations with the TUG, 2-Minute Walk Test, ABC scale, and the mobility subscale of the Prosthetic Evaluation Questionnaire (PEQ-MS) have been demonstrated among individuals with lower-limb amputations.21
The mobility capability of the subjects was assessed using the 13-item self-report PEQ-MS.22 The PEQ-MS evaluates the perceived potential for mobility using prosthetic devices over the past 4 weeks. The mobility scale is the combination of the ambulation and transfer subscales from the full PEQ. Although the original questionnaire uses a visual analog scale (VAS), we modified the responses to a numerical scale ranging from 0 ("cannot do activity") to 10 ("no problem"). A summary score was derived by adding the responses and then dividing by the number of items to which the subjects responded. Internal consistency (Cronbach alpha=.95), retest reliability (ICC=.77), and support for validity based on correlations with data for the TUG (r=.5), the 2-Minute Walk Test (r=.5), and the ABC scale (r=.85) have been reported among individuals with lower-limb amputations.23 Furthermore, the PEQ-MS was found to discriminate among individuals based on gait aid use and self-reported walking distance, among other factors.23
A number of clinical variables were included to further assess the validity of the L Test data. These variables included: (1) amputation level (TF or TT), (2) amputation cause (vascular or traumatic), (3) mobility device use (cane, crutches, or walker), (4) automatic walking (automatism), and (5) age. Measurement of mobility device use was ascertained by asking the participants whether they used crutches, a walker, or a cane while ambulating indoors or outdoors, or no device at all. Automatic walking (automatism) was determined by asking the participants whether they had to concentrate on each step while walking. Individuals responded to this question, which was taken from the Prosthetic Profile of the Amputee (PPA),24 by indicating "yes" or "no." Age was partitioned into a binary variable using the median score to facilitate analyses.
Data Analysis
Means, standard deviations, and proportions were derived in order to facilitate description of the sample and scores for L Test and retest results. A repeated-measures analysis of variance (ANOVA) using Bonferroni post hoc analyses was used to determine whether statistically significant differences existed on L Test times over the 3 measurement periods (time 1, 2, and [retest] 3). Data from trial 3 of each measurement period were used in the analysis.
Reliability was calculated using 2-way ANOVAs to derive intrarater ICCs (2,1) and interrater ICCs (2,2).25 The third L Test trials from time 1 and time 3 were used for calculating the intrarater ICC, and the third L Test trials from time 1 and time 2 were used for calculating the interrater reliability. The standard error of measurement (SEM)26 was used to assess how an individual score varied on repeated measurement. The SEM was derived by multiplying the standard deviation by the square root of 1 minus the reliability coefficient derived from the interrater reliability analysis. Finally, the Bland-Altman or Limits of Agreement method27 was used to provide a visual assessment of within-test repeated measurement of the agreement between raters and to identify any bias that might exist.28 Rankin and Stokes28 have advocated this approach as being easier to understand and interpret. The mean of the third trial in each of the time 1 and time 2 L Test times were plotted against the difference between the time 1 and time 2 L Test times for the Bland-Altman method, and a t test was used to assess mean differences. Furthermore, the mean and standard deviation of the mean difference as well as the true value of the mean using 95% confidence intervals (CIs) were calculated to further assess the existence of bias.27 The Bland-Altman procedure was not done for intrarater reliability because the sample size was smaller than 50 subjects.27,28
Validity was assessed by testing several a priori hypotheses related to whether the L Test data correlated with data for the other walk tests and the self-report scales. A positive correlation was anticipated between the L Test and both the TUG and 10-Meter Walk Test because these are all measures of gait speed. A negative correlation was expected between the L Test and the 2-Minute Walk Test because those individuals who are more likely to complete the L Test quickly would be expected to walk farther on the 2-Minute Walk Test. Finally, we hypothesized that the L Test data would correlate negatively with data for the self-report scales based on previous research we have done. That is, individuals who scored better (higher) on the FAI,21 PEQ-MS,23 and ABC17 scales would have scores that correlate negatively with the scores on the L Test.
Pearson product moment correlation coefficients were calculated in all cases. Our clinical experience suggested that the L Test should be able to discriminate between those with TT and TF amputations and those with vascular versus traumatic amputations. Furthermore, we expected that the mean time to complete an L Test would be higher among older individuals, among individuals who used a mobility device, and among those who had to think about stepping.
In order to assess whether the L Test would reduce the number of subjects experiencing a possible ceiling effect, we operationally defined the ceiling as being within the 95% confidence limit of the fastest overall time recorded by any subject on the L Test and the TUG. To calculate the 95% confidence limit, we first derived the SEM (SEM=standard deviation of the test/square root of the sample size) for each test and then multiplied it by 1.96. Next, we used the McNemar test to determine whether there were fewer subjects within the range of the ceiling effect for the L Test than for the TUG because the data are not independent (the same subjects provided information for both tests).26 Times for the L Test from trial 3 of time 1 were used for this determination.
Correlations were considered statistically significant at P<.05, and significance for all other tests was set at P<.01 in order to reduce the potential for type I errors that may result from multiple testing. Analyses were conducted using SPSS for Windows, version 8.*
| Results |
|---|
|
|
|---|
|
Additional results for intrarater reliability using the Bland-Altman plot (Fig. 1) demonstrated a roughly equal distribution above and below the 0 line. Slightly more data points above the line for the interrater plot suggested that times were slightly faster during the first test session (time 1) than during the second test session (time 2). The distribution of the points was not equal along the line; therefore, reliability was not perfect (because of clustering around the left-hand side of the plot). This finding also suggests that a ceiling effect may be evident in the L Test. The mean difference was 0.31 (SD=4.4), with a 95% CI of 1.2 to 0.59. The inclusion of 0 in the CI suggests minimal bias among the measurement sessions. Only 3 of the data points out of a possible total of 93 data points were found to lie outside of the 95% CI (2 standard deviations of the mean difference), further suggesting that the error in reliability readings for the interrater reliability was not statistically significant.27,28 A t test of the mean values demonstrated there was no statistical difference (t=0.678, P=.499).
|
|
|
|
| Discussion |
|---|
|
|
|---|
Other walk tests exist; however, the premise behind the development of the L Test was that it would require a higher level of skill with turns to both the left and the right as well as a sit-to-stand transfer. Face validity suggests that this test reflects the minimal needs for functional mobility in the home. For example, the L Test would indicate whether person would be able to rise from a couch in the living room and ambulate to the washroom. Another advantage of the L Test is that it incorporates a pragmatic design that allows testing in an outpatient hospital clinic setting. For example, upon examination in a typical small office, the patient is commonly asked to rise from the chair or plinth, walk forward out of the office door, turn, walk down the hallway, and then return and sit down. This activity enables the clinician to visually assess the biomechanics and prosthesis-limb interface. The 20-m distance covered by the L Test is twice that of the 10-Meter Walk Test, and 3 times that of the TUG. The distance was increased with the intent that the L Test would be more responsive than the other tests and, therefore, more useful when used with younger individuals who have a lower-limb amputation. Although we cannot comment specifically on the true nature of responsiveness of the L Test, we have shown that it does minimize the ceiling effect observed in the TUG.
The time taken to complete the L Test is generally twice that of the TUG and commonly approaches the 2-minute duration of the 2-Minute Walk Test, especially among the older, frailer population (Fig. 2A). This finding suggests that the L Test may serve as a tool that assesses the transition between room ambulation reflected by the TUG and community ambulation reflected by the 2-Minute Walk Test. Five out of 9 of the recruited subjects who were excluded from this study were unable to complete the L Test, further suggesting that the L Test might require additional physical resources for these individuals.
In the present study, we found the L Test to have excellent interrater and intrarater reliability, as demonstrated by the ICC values. Consequently, this finding suggests that overall there is minimal measurement error resulting when this measure is used in a standardized fashion by different raters or by the same rater over time. We had anticipated such results, given that other authors5,14 have reported similar findings with comparable walk tests among people with amputations. The greatest variation in the scores was seen among subjects who had very slow times on the L Test, and 3 of these "slower" subjects recorded scores that were more than 2 standard deviations different when tested by the different raters. It is unclear why the scores of these subjects were so different between tests. An increase in time was observed when 2 individuals were tested by the second rater, which may have been the result of fatigue associated with multiple walk tests conducted during the session. Additional evidence for this hypothesis can be drawn from the fact that both the standard deviation and the range of times for the L Test increased between time 1 (range=17.495.2 seconds) and time 2 (range=17.1108.7 seconds). Time 3 L Test times (range=20.148.7 seconds) improved considerably, perhaps suggesting a learning effect; however, no statistical difference was observed in L Test times among testing times 1, 2, and 3. Given that fewer than 5% of the subjects recorded such different scores and that 0 were captured within the 95% CI for the mean difference in recorded times28 (as reflected in the Bland-Altman analyses), we are confident that the L Test is robust with respect to reliability in people with lower-extremity amputations. However, because reliability is population-specific, assessment for repeatability in other diagnostic groups is encouraged.29
Additional analysis of reliability was conducted using the limits-of-agreement approach advocated by Bland and Altman.27 The data points across the Bland-Altman zero line (Fig. 1) tended to concentrate at the left end, indicating that a large percentage of the subjects completed the test relatively quickly (ranging between 20 and 40 seconds). This concentration of data points near the left end may be explained by the number of subjects whose amputation was related to traumatic etiology or because the inclusion criterion for the study essentially focused on experienced prosthesis users who were not having problems with respect to pain and likely had reached a plateau or were confident in their ability to use the prosthesis. Ideally, the data points would be evenly distributed along and on either side of the line. The findings suggest that the L Test might not differentiate between various subgroups of the population of people with amputations. Despite this finding, the discriminant validity was observed to be quite good. All of the projected differences were observed according to our a priori hypotheses. It is particularly encouraging that amputation level is discriminated. Tests of functional mobility that do not distinguish among amputation levels are highly suspect regarding their utility, given the clinical observation that individuals with TF amputations face many more challenges than those with TT amputations.
The L Test data were correlated with data for all of the self-report measures in the hypothesized direction. Given the similarities, we expected to find a stronger relationship between the PEQ-MS and the L Test, because a previous study23 demonstrated covariance of up to 25% (r=.5) between the PEQ-MS and the TUG. We anticipated greater overlap between the self-evaluation measure of skill capability (PEQ-MS) and the objective measurement captured by the L Test.
The interrater SEM of the sample (3.0) provides a platform toward thinking about values that may demonstrate a relevant level of responsiveness given a change at the group level.30,31 For instance, we can be 68% sure that a "true" change has occurred if the group value shifts ±2.6, and 95% sure that a statistically relevant change has occurred with a shift in score of greater than ±6.2 seconds. Although this information might provide some preliminary information for future studies using the L Test, additional trials are needed to better investigate other aspects of responsiveness such as minimally important clinical change.30
In the present study, the walk tests were sequenced in orderprogressing from the TUG, to the 10-Meter Walk Test, to the L Test, and finally to the 2-Minute Walk Testwhich enabled us to watch for subjects who might be compromised in terms of their cardiovascular capacity and therefore not be able to proceed to the next level of walk test. Despite this approach, 9 of the 102 recruited subjects were unable to complete the walk tests. Ideally, the order should have been randomized to minimize bias related to order effects. Using a randomly selected sample versus a convenience sample would have led to more confidence in the findings, because the population seen during the data collection period may have been systematically different. Furthermore, based on the L Test scores at time 3, which tended to be faster and had a tighter range of times (eg, the standard deviation was smaller), our intrarater subsample also may have been different than our total sample, despite having very similar demographics related to age, years since the amputation, and amputation level. The generalizability of the study results also is limited to people with unilateral amputations who had had their prosthesis for at least 6 months who were medically stable and experienced users of prosthetic limbs. Replication studies that compensate for these limitations are encouraged.
| Conclusion |
|---|
|
|
|---|
| Footnotes |
|---|
The study protocol was approved by the University of Western Ontario human ethics board.
Financial support for the study was provided by the Parkwood Hospital Foundation. The authors acknowledge Dr Miller's postdoctoral salary support award from the Canadian Institutes of Health Research and the Michael Smith Foundation for Health Research.
* SPSS Inc, 233 S Wacker Dr, Chicago, IL 60606. ![]()
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. S Gilchrist, M. L. Galantino, M. Wampler, V. G Marchese, G S. Morris, and K. K Ness A Framework for Assessment in Oncology Rehabilitation Physical Therapy, March 1, 2009; 89(3): 286 - 306. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-H. Jan, J.-J. Lin, J.-J. Liau, Y.-F. Lin, and D.-H. Lin Investigation of Clinical Effects of High- and Low-Resistance Training for Patients With Knee Osteoarthritis: A Randomized Controlled Trial Physical Therapy, April 1, 2008; 88(4): 427 - 436. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. R Archer, R. C Castillo, E. J MacKenzie, M. J Bosse, and and the Lower Extremity Assessment Project (LEAP) Gait Symmetry and Walking Speed Analysis Following Lower-Extremity Trauma Physical Therapy, December 1, 2006; 86(12): 1630 - 1640. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |