PTJ
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


PHYS THER
Vol. 82, No. 8, August 2002, pp. 762-771

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Rapid Responses are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Flegel, J.
Right arrow Articles by HA Kolobe, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Flegel, J.
Right arrow Articles by HA Kolobe, T.
Related Collections
Right arrow Motor Development
Right arrow Tests and Measurements
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Research Reports

Predictive Validity of the Test of Infant Motor Performance as Measured by the Bruininks-Oseretsky Test of Motor Proficiency at School Age

Judy Flegel and Thubi HA Kolobe

J Flegel, PT, MS, is Physical Therapist, Curative Rehabilitation Services, Milwaukee, Wis. She was a student at the University of Illinois at Chicago at the time this research was completed in partial fulfillment of the requirements for her Master of Science degree in physical therapy.
THA Kolobe, PT, PhD, is Assistant Professor, Department of Physical Therapy, University of Illinois at Chicago, Chicago, Ill

Address all correspondence to Ms Flegel at 4069 N Prospect, Milwaukee, WI 53211 (USA) (Judyfpt{at}aol.com)


Submitted July 20, 2001; Accepted February 6, 2002


    Abstract
 
Background and Purpose. As the survival rate of infants who are born prematurely increases, the need for accuracy in early identification and prediction of developmental outcome in these infants is imperative. This study examined the predictive validity of the Test of Infant Motor Performance (TIMP) and the relationship between perinatal risk and motor performance in children who are of school age. Subjects and Methods. A stratified, random sample of 35 children was selected from subjects who had been examined using the TIMP and the Problem-Oriented Perinatal Risk Assessment System (POPRAS). The subjects were between 32 weeks gestational age and 4 months postterm (mean age=10 days postterm). The stratification was based on age at the time of TIMP administration (AGE) and on POPRAS scores. The Bruininks-Oseretsky Test of Motor Proficiency (BOTMP) was administered to the children at a mean age of 5.75 years. Predictive values were calculated and correlation and regression analyses were conducted to examine the relationship between the TIMP and POPRAS scores and the BOTMP score. Results. Using the receiver operating characteristic curve and a cutoff z score of –1.6, the TIMP's sensitivity, specificity, and positive and negative predictive values were .50, 1.00, 1.00, and .87, respectively. The correlation between TIMP and BOTMP scores, with AGE controlled, was .36, and the correlation between POPRAS and BOTMP scores was –.55. Thirty-seven percent of BOTMP variance was accounted for collectively by POPRAS scores, TIMP scores, and AGE. Discussion and Conclusion. The TIMP and POPRAS may be viable instruments that can be used together to identify infants who are at risk for poor long-term motor performance.

Key Words: Bruininks-Oseretsky Test of Motor Proficiency • Children • Infant • Motor development • Predictive validity


    Introduction
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 
Early identification of children who are at risk for developmental delays is a primary focus of physical therapy for preterm infants in neonatal intensive care units (NICUs). A preliminary study by Girolami and Campbell1 suggests that early identification of these infants with early introduction of preventive interventions may avert functional limitations at a later age. The effectiveness of prevention, however, depends in part on the accuracy with which diagnostic tests can identify infants who are likely to benefit from these interventions. Clinical use of an infant motor development test with a low positive predictive value, for example, may result in overreferral of infants, wasting resources on infants misdiagnosed as having abnormal motor development.

Given the emotional and financial burden placed on families when there is intervention with infants who are at risk for developmental delays, clinicians must consider the information about sensitivity and specificity when selecting a diagnostic test. Sensitivity and specificity pertain to how well a test performs in the presence (or absence) of the condition of interest. A sensitive test is likely to miss few infants with atypical motor development (low false negative).2 Because it is unlikely that a test will be both highly sensitive and highly specific, the decision to use a particular test depends on the type of risk for misclassification that the test user is willing to accept.2 Fletcher and associates2 recommend using a test with high sensitivity when there is an important consequence for missing a disorder (eg, a serious, progressive condition that is treatable). For children with typical motor development, a specific test is less likely to misdiagnose them as having atypical motor development (low false positive). A highly specific test is useful in substantiating a diagnosis that may have been suggested by other information.2

Because the accuracy of diagnosis of infants born prematurely increases with age,3,4 we believe it is important that, in addition to sensitivity and specificity, diagnostic tools that are used during the neonatal period also demonstrate high predictive values. Predictive values are useful in interpreting the test results of an individual infant. A positive predictive value describes the probability that an infant with a positive test result will indeed have the disorder, whereas a negative predictive value describes the probability that an infant with a negative test result will have a normal outcome.2

Many instruments have been developed to predict developmental outcome in infants, but their accuracy has varied.5 Few of these instruments focus on motor performance. In studies that have examined the predictive validity of motor performance assessments, we believe 2 problems may limit the clinical application of the findings with preterm and very young infants. The first problem is the age at which the outcome measure is administered. Evidence exists that some of the neurological and medical abnormalities observed in premature infants during the first 2 years of life may be transient (transient dystonia).68 Given this instability of neuromotor behavior during the first 2 years of life, it would appear that obtaining a measurement after 2 years of age would circumvent this problem.

The second issue is the infant's age at the time the predictive test is administered. The tendency in numerous predictive studies has been to test infants at 4 months of age or older, even though the test under investigation is designed to be used for younger ages (0–4 months).9,10 Consequently, there is a dearth of information regarding motor tests that could be used in NICUs or nurseries to identify infants who are at risk for developmental delays. If the objective of prediction is early identification of infants at risk, such as when the infant is still in the NICU, it is essential that very young infants are included in studies of predictive validity.

The primary purpose of our study was to examine the predictive validity of the Test of Infant Motor Performance (TIMP). We examined whether TIMP scores collected between 32 weeks gestational age and 16 weeks postterm age accurately classified the children as typically developing or developmentally delayed based on fine and gross motor performance at early school age as measured by the Bruininks-Oseretsky Test of Motor Proficiency (BOTMP).11

The TIMP, developed by Campbell and associates,12 is a comprehensive motor test designed to assess functional motor performance of infants between 32 weeks postconceptional age and 16 weeks postterm age. Initial reports on the TIMP suggest that it yields reliable measurements that are valid for discriminating optimal motor performance from poor motor performance in infants born preterm and very young infants.1214 Test of Infant Motor Performance scores have been reported to be sensitive to changes in infants' motor performance due to maturation and medical complications.12 Infants with greater numbers of medical complications, as measured by the newborn form of the Problem-Oriented Perinatal Risk Assessment System (POPRAS),15 had lower TIMP scores than infants of the same age with fewer medical complications.

The TIMP consists of 59 items divided into 2 sections: Elicited and Observed. The Elicited section items assess the infants' motor responses to placement in various positions and to visual or auditory stimulation. The Observed section items are used to rate spontaneous movement exhibited by infants. Items for the TIMP were taken from neurologic and developmental tests, such as tests developed by Dubowitz et al16 and motor assessment procedures developed by Cioni and Prechtl.17 Additional items and scoring descriptors were developed by Campbell et al.12 The Rasch psychometric model guided the development of the items.18 The items on the TIMP have been shown to have ecological validity (ie, the Elicited section items are similar to demands placed on infants in everyday caregiving situations).19 Concurrent validity with the Alberta Infant Motor Scale was demonstrated for infants at 3 months of age.14 In our opinion, satisfactory interrater reliability of TIMP measurements among experienced examiners and satisfactory test-retest reliability (r=.89) also have been reported.12,13 Although the TIMP demonstrates attributes of a valid developmental test for very young infants, there are no reports concerning whether TIMP scores can predict motor outcome at a later age.

Because motor performance on the TIMP has been found to be correlated with scores on the POPRAS,12 the infants' medical complications cannot be overlooked when examining the predictive validity of the TIMP. Therefore, a secondary purpose of our study was to examine the relationship between perinatal risk, as measured by the POPRAS, and BOTMP scores at early school age. The POPRAS is a perinatal risk scale that was developed by Hobel and associates20 to assess prenatal and perinatal medical complications. All information needed to score the POPRAS is obtained from the medical record. Higher POPRAS scores are associated with greater medical complications, particularly during the neonatal period.20,21


    Method
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 
Subjects

The subjects were 35 children at early school age who, as infants, had been tested using the TIMP during the test's development in a study by Campbell et al.12 The original sample (N=137) consisted of preterm and full-term infants with varying degrees of medical complications who were recruited from 3 nurseries and communities in the Chicago metropolitan area. Infants with congenital malformations were excluded. At the initial TIMP testing, the testing, the infants' ages ranged from 32 weeks gestational age to 3.5 months postterm age. The infants were stratified into 7 age groups and 3 risk categories based on age at the time of TIMP administration (AGE) and the POPRAS scores, respectively (Tab. 1). The POPRAS was completed by a trained nurse/recruiter. All infants were medically stable and were tested by physical therapists and occupational therapists who had been trained to administer and score the TIMP. The young infants were tested 1 hour before feeding time either in their incubators or at home.


View this table:
[in this window]
[in a new window]
Table 1. Sample Stratified Into 7 Age Groups and 3 Risk Categoriesa

 
The subjects in our study were selected through a stratified sampling approach that was similar to that used in the original sample, which consisted of 7 age groups (based on AGE) that were then subdivided into 3 categories each based on medical risk as measured by the POPRAS (Tab. 1). Because of the difficulty in obtaining subjects in follow-up study, we conducted a power analysis to determine the sample size for indentifying associations at the .05 level ({alpha}=.05, power for the F test=0.80, and R2=.27). Based on the discussion of guidelines for the use of correlation coefficients for health science studies in Portney and Watkins,22 we selected an R2 value of .27 because it reflected a moderate degree of association for our power analysis. The minimum number of subjects necessary was determined to be 34. Sixty-five of the 137 subjects from the Campbell et al12 study were located. Two children from each age group and risk category were then randomly selected for testing, except for 7 categories (either risk or age) in which only 1 child had been located. This stratification process yielded 12 children (34.3%) who were classified as being at low risk for developmental delays, 10 children (28.6%) who were classified as being at medium risk for developmental delays, and 13 children (37.1%) who were classified as being at high risk for developmental delays. There were 4 to 6 children in each of the 7 age categories.

The parent or guardian of each child gave permission for the child's participation in the study. The subjects had an average gestational age of 32 weeks (SD=5, range=24–41) and had received the TIMP at an average postnatal age of 59 days (SD=49, range=2–164). The mean POPRAS score for the total sample, as infants, was 73.7 (SD=44.3, range=2–166). At the time of testing on the BOTMP, the children ranged in age from 4 years 9 months to 7 years 3 months, with a mean age of 5 years 8 months. There were 19 male children and 16 female children. Sixteen (45.7%) of the children were white, 10 (28.6%) were Latino, and 9 (25.7%) were African American. Three children were diagnosed with cerebral palsy, and 1 child was diagnosed with Down syndrome.

Instrument

The complete battery of the BOTMP was used as a criterion measure of motor outcome. This test is used to assess the motor functioning of children from 4.5 to 14.5 years of age.11 The BOTMP consists of 8 subtests (running speed and agility, balance, bilateral coordination, strength, upper-limb coordination, response speed, visual-motor control, and upper-limb speed and dexterity), with a total of 46 separate items that provide a comprehensive index of motor proficiency as well as separate measures of both gross and fine motor skills. The BOTMP provides age-normed, composite scores, based on a T-score with a mean of 50 (SD=10).11

Reliability

Prior to data collection, interrater reliability for BOTMP scores was established using 6 children of varying abilities who were not part of the study but were within the age range of those in the study. The first author (JF) administered the BOTMP to 4 children while a therapist who used the BOTMP extensively watched and scored independently. The experienced BOTMP user administered the test to 2 children while the first author watched and scored independently. The authors have had extensive experience (7–24 years) in pediatrics, including administration and scoring of various motor tests. The intraclass correlation coefficient (2,1) for the total battery composite score was .97.

The interrater and test-retest scores stability on the TIMP over a 3-day period have been reported.12,13 The Facets computer program* for Rasch psychometric analysis23 was used to determine the scoring consistency for ratings by testers. A criterion of fewer than 5% misfitting ratings (ie, the number of unexpected ratings given the infant's level of ability and the item difficulty) was used for interrater reliability.24 The correlation between the scores over a 3-day period was .89.13

Procedure

The first author administered the BOTMP according to the standardized instructions in the test manual. The majority of children were tested in their homes. Because the subjects for this study were young children, a failure to cooperate was an issue in some cases. This issue was handled as recommended in the BOTMP manual,11 which provides the options of postponing the completion of the session until another day or administering items in a different order than that in the manual. Only 2 children were uncooperative at initial testing, and completion of the session was postponed for them. In both cases, the test was completed in 2 additional sessions that were scheduled within 8 days of the initial test, as recommended in the BOTMP manual. The test was scored according to the instructions in the manual. For purposes of scoring the BOTMP, each child's age was calculated from his or her date of birth. All children were tested at their homes, except for 3 children who were tested in a physical therapy classroom.

At the end of the session, the parent or guardian accompanying the child was asked to fill out a questionnaire with items for demographics and the child's medical and developmental history. This information was used to determine whether the infants had medical problems during the period between the TIMP and BOTMP testing that would put them at risk for developmental delays or disabilities. These medical problems would change the nature of the relationship between the results of the 2 tests, because the child might develop motor problems as a result of a new medical problem that did not exist at the time of TIMP testing.

The TIMP and POPRAS scores from the study by Campbell and associates12 were obtained after the administration and scoring of the BOTMP were completed. The first author, therefore, was unaware of the child's POPRAS and TIMP scores when administering the BOTMP.

Data Analysis

Sensitivity, specificity, and positive and negative predictive values were calculated to identify a TIMP cutoff score that might be used in a clinical setting to identify infants with current motor delays who were likely to have long-term motor delay or poor motor performance as assessed with the BOTMP. Poor motor performance was defined as a z score on the BOTMP of –1.5. This cutoff represents the middle range of scores that are considered "low performance" in the BOTMP manual.11 There is no recommended cutoff for the BOTMP composite score in the literature, except what is provided in the manual for the subtests.

Because the TIMP has not been normed, the z scores for the TIMP were first calculated using the means and standard deviations for each of the 7 age groups reported by Campbell et al.12 Because the TIMP z scores used for this analysis were not based on a random sample of the high-risk population, the results of this analysis must be considered preliminary. The receiver operating characteristic curves were used to explore various cutoffs on the TIMP for predicting BOTMP z scores of –1.5. The most correct classifications overall were obtained using a TIMP z score of –1.6. Given our small sample, confidence intervals (CIs) for the sensitivity, specificity, and positive and negative predictive values were calculated using the formula proposed by Fletcher et al.2 According to Fletcher and colleagues, point estimates of sensitivity and specificity values tend to be misleading when the sample is small. These authors recommended reporting the 95% CI for the range of values. Similarly, the diagnostic efficiency of the POPRAS was examined.

The relationship between TIMP scores (when adjusted for AGE) and BOTMP battery composite scores was examined using the Pearson product moment correlation coefficient (partial correlation). The TIMP total scores and BOTMP standard composite scores were used, with AGE controlled. It was necessary to control the effect of AGE because the TIMP yields a raw score, with no age norms, and because postconceptional age and TIMP scores are highly correlated (r=.83).12 A Pearson product moment correlation coefficient also was calculated to determine the relationship between the POPRAS and BOTMP battery composite scores.

A hierarchical multiple regression analysis was used to examine the amount of variance in the BOTMP score that was accounted for by the TIMP score (adjusted for AGE), beyond that accounted for by the POPRAS. Order of entry for the independent variables was: AGE, POPRAS score, and TIMP score. Although the POPRAS scores had been categorized in the original study (low risk=a score less than 60, medium risk=a score between 61 and 90, and high risk=a score greater than 91) based on medical complications, the scores were analyzed as continuous data in this analysis.


    Results
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 
The mean BOTMP total battery composite score for the total sample was 43.9 (SD=11.6, range=20–65). The mean TIMP score for the total sample was 82.7 (SD=30.7, range=16–157). At school age, 8 children were classified as having motor delays by the BOTMP (23%): 1 with hemiplegia, 2 with spastic quadriplegia, 1 with Down syndrome, and 4 with developmental delays. The gestational age at birth was less than 29 weeks for 6 of the 8 infants with developmental delays.

Table 2 presents the sensitivity, specificity, and positive and negative predictive values using various cutoff scores for the TIMP and a z score on the BOTMP of –1.5 as the cutoff for differentiating normal versus abnormal motor outcomes. Using a cutoff z score of –1.6 for the TIMP, 4 true positive, 0 false positive, 27 true negative, and 4 false negative classifications were observed. These findings resulted in a sensitivity value of .50, a specificity value of 1.0, a positive predictive value of 1.0, and negative predictive value of .87 for the TIMP. Using the 95% CI, the sensitivity values ranged from .33 to .67, and the negative predictive values ranged from .76 to .98. Higher cutoff scores—for example, a z score of –.50—increased the sensitivity values (.75) but at the expense of a lower specificity value (.63) and a lower positive predictive value (.38) (Tab. 2). Overall, 89% of the children were correctly classified by the TIMP cutoff z score of –1.6.


View this table:
[in this window]
[in a new window]
Table 2. Summary of Sensitivity, Specificity, and Predictive Values of the Test of Infant Motor Performance (TIMP) in Predicting Performance on the Bruininks-Oseretsky Test of Motor Proficiency

 
Table 3 presents the sensitivity, specificity, and predictive values comparing the BOTMP z scores of lower than –1.5 at school age against the POPRAS scores. The cutoff score of 60 risk points on the POPRAS for predicting the school-age BOTMP yielded a sensitivity of 1.0, specificity of .44, positive predictive value of .35, and negative predictive value of 1.0. The cutoff of 60 risk points has been used to classify infants into low and moderate risk categories.12 The cutoff score that maximized the sensitivity and specificity of the POPRAS was 80 points. This cutoff point (and the 95% CI) resulted in a sensitivity of .88 (.77–.99), a specificity of .74 (.59–.89), and positive and negative predictive values of .50 (.33–.67) and .95 (.88–1.00), respectively. Overall, 75% of the infants were correctly classified by the POPRAS.


View this table:
[in this window]
[in a new window]
Table 3. Summary of Sensitivity, Specificity, and Predictive Values of the Problem-Oriented Perinatal Risk Assessment System (POPRAS) in Predicting Performance on the Bruininks-Oseretsky Test of Motor Proficiency

 
The partial correlation coefficient for TIMP scores and BOTMP total battery composite scores, with AGE controlled, was low but statistically significant (r=.36, P=.034). The Pearson product moment correlation coefficient between POPRAS and BOTMP total battery composite scores was higher and also statistically significant (r=–.55, P–.001). Table 4 presents the results of the hierarchical regression. Thirty-seven percent of the variance in BOTMP was explained by the combination of AGE, POPRAS, and TIMP. The POPRAS was a significant predictor variable (ß=–.54, P=.001). The TIMP did not reach the .05 level of significance when entered after AGE and POPRAS (ß=.46, P=.097).


View this table:
[in this window]
[in a new window]
Table 4. Results of Hierarchical Multiple Regressiona

 

    Discussion
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 
Whether a developmental test should be used to diagnose infants who are likely to demonstrate typical or atypical motor performance at a later age depends on the purpose of the predictions and on the diagnostic values reported for the test, particularly the sensitivity and specificity values. Using a z score of –1.6 as a cutoff point, sensitivity of the TIMP was .50 and specificity was 1.00. The sensitivity of .50 represents a high rate of false negative results when the TIMP is administered to infants between 32 weeks gestational age and 13 weeks postterm age. Conceivably, the TIMP cutoff score may only be effective in identifying 50% of the infants who are likely to demonstrate atypical motor performance at school age when tested on the BOTMP (high false negative). Conversely, the specificity of 1.00 suggests that TIMP z scores below –1.6 at 32 weeks gestational age to 13 weeks postterm age will not falsely identify a motor delay at school age (low false positive). All of the children who were classified as having normal motor development on the BOTMP had been classified as having normal motor development on the TIMP. Overall, 89% of the children were correctly classified by the TIMP cutoff z score of –1.6.

A certain number of false negative classifications are expected with any test; however, the relatively high number of such classifications in this study is somewhat unusual because false positive classifications are the more typical problem with infant examinations.25,26 High false negative results are a concern to clinicians because they represent children with motor delays at school age who may not have received early intervention or periodic testing based on their early test scores. Therefore, to further explore the findings on sensitivity, each false negative case was inspected (Tab. 5). In particular, the age at which the TIMP was administered was examined to determine whether there was a particular age at which the TIMP score is likely to yield a false negative prediction. Second, we examined the questionnaires that the subjects' parents had completed to determine whether any of the children had notable injuries or illnesses after the administration of the TIMP that might have affected their later motor performance. Finally, the therapists' comments on the TIMP and on the BOTMP score forms were examined to determine whether any unusual occurrences were noted during testing. All 4 of the children with false negative results were born preterm (27–32 weeks gestational age at birth) and were classified as being at moderate to high risk for developmental delays based on their POPRAS scores. Two children were diagnosed with cerebral palsy and the other 2 were diagnosed with developmental delays, based on their scores on the BOTMP.


View this table:
[in this window]
[in a new window]
Table 5. False Negative Casesa

 
There was no evidence that any specific confounding factor systematically gave rise to the false negative classifications. Each of the children with false negative scores was from a different age group, suggesting that there is no one particular age at which the TIMP score is more likely to result in a false negative prediction. The 2 children with false negative classifications who were eventually diagnosed with cerebral palsy (children 1 and 4) had been administered the TIMP near term age (37 and 39 weeks postconceptional age). The findings by Ferrari et al27 may provide one plausible explanation for our findings.

Ferrari and associates27 examined the development of 29 infants who were at high risk for developmental delays from approximately 32 weeks postconceptional age to approximately 1 to 2 years of age. They reported that 9 of the infants whose previous neurological examination results were poor were considered developmentally normal around term age. The neurological examination results had been abnormal prior to term age, and were abnormal again several weeks later. The authors concluded that the transient normal neurological examination results were due to an interval of "normal" muscle tone (not defined by the authors) during the transition from hypotonia to hypertonia, which often occurs with abnormal motor development. Based on the observation by Ferrari and associates,27 future predictive studies using the TIMP may need to examine whether administration of the TIMP near term age may be more likely to yield a false negative classification.

Based on data from the parent questionnaires, child 1 also was reported to have had a condition that might have resulted in motor disability after the initial TIMP testing. This child had a history of epilepsy that, if not successfully controlled, might have contributed to a decline in motor performance after TIMP testing. Finally, based on comments made on the BOTMP, it is possible that the performance of child 2 on the BOTMP may have been affected by a poor attention span and hesitation in participating. The therapist noted that testing for this child had to be repeated in 3 sessions over a period of 3 days. There was no finding that we felt was consequential with child 3.

To interpret an individual child's test scores, information on the test's predictive values and the prevalence of the condition of interest is needed. The TIMP cutoff z score of –1.6 yielded excellent predictive values (ie, positive predictive values of .75–.80 are considered good28). The positive predictive value and negative predictive value were 1.00 and .87, respectively. These values suggest that, given similar prevalence rates for poor motor outcome, an infant with TIMP z scores of less than –1.6 (less than average performance) is 100% likely to demonstrate less than optimal motor proficiency at early school age, as measured by the BOTMP, whereas an infant with a TIMP score above the cutoff point has an 87% chance of showing optimal motor performance. Based on these findings, it appears that delayed motor performance, as measured by the TIMP, at this very early age in an infant's life may persist to early school age. Although positive predictive values are affected by the prevalence of the condition and are highly inflated if the prevalence is high,2 the 23% prevalence of developmental delay in our sample is consistent with (or, in some cases, lower than) that reported in the literature.29

The cutoff score of 80 provided the best diagnostic values for the POPRAS. The sensitivity and specificity were .88 and .74, respectively. Positive and negative predictive values were .50 and .95, respectively. The sensitivity of .88 suggests that the cutoff score of 80 on the POPRAS may be more effective than the TIMP cutoff score in identifying infants who are likely to experience motor delays later in life; however, the specificity values suggest that the POPRAS yields more false positive classifications than the TIMP. The results of the predictive values also indicate that half of the infants who are identified by the POPRAS as having abnormal motor development will actually have a normal motor outcome, whereas most of those identified as having normal motor development (95%) may have a normal motor outcome. The POPRAS may be more useful if the risk of overidentification of infants as having abnormal motor development is acceptable, but having some certainty that infants with motor problems are not being overlooked is necessary.

The moderate-to-good association between POPRAS and BOTMP scores observed in this study suggests that a child's early medical complications continue to influence motor performance at a later age. The simple correlation between the POPRAS and BOTMP was strong: –.55 (P=.001, R2=.30). We believe this finding is notable because the POPRAS was not designed for long-term prediction, but rather to predict medical risk for mortality in the perinatal and neonatal period.20 Scores on the POPRAS have been shown to be correlated with 1- and 5-minute Apgar scores30 as well as with primary cesarean section.31 The correlation between POPRAS score and later neuromotor outcome observed in our study is consistent with the findings of Campbell and Wilhelm.32 These authors reported that 60% (9 out of 15) of infants who were classified as being at high risk for developmental delay based on a combination of high POPRAS scores and other risk factors had major or minor neuromotor problems at 1 to 3 years of age.

Based on reports of their parents, 10 of the children in this study had received some type of motor therapy at some point during the interval between TIMP/POPRAS testing and BOTMP testing. The effect that this intervention might have had on the child's BOTMP score is unclear. The information on therapies received was based on parent recollection, and the reliability of the frequency, duration, and type of therapy that the child received is unknown.

Clinical Implications

A decision to use any one test for diagnostic purposes hinges on several factors: the psychometric properties of the test (including reliability, sensitivity, and specificity) and ease of administration of the test (including risk and cost factors). The TIMP appears to have good psychometric properties in terms of construct validity and reliability.12,13 Our findings suggest that the TIMP may not be useful if the clinician feels that it is very important to identify most of the children who have a poor neuromotor outcome, but is willing to accept overidentification of abnormal motor development in infants who will have normal motor outcomes. The TIMP, however, appears to be a useful tool if the clinician is interested in the reverse (ie, confidence that overidentification of infants as having abnormal motor development is minimized, but willing to accept the risk that some infants with problems will be overlooked).

According to Fletcher and associates,2 a test with high specificity is useful when false positive results can harm the patient physically, emotionally, or financially. Although unnecessary interventions are probably not physically harmful to the infant, they have an emotional and financial cost to families. Additionally, Fletcher et al recommend that sensitive tests be used for dangerous, treatable conditions. Although it would be ideal not to overlook an infant with motor problems during early infancy, most neuromotor problems that would be identified on the TIMP actually are not considered of immediate danger to the infant. However, because there is some evidence that early intervention may be effective, false negative results are always a concern. Infants may be monitored through early periodic screening and evaluation programs, which have become a standard practice in most hospitals with level-III nurseries.33 Furthermore, there is currently no other infant motor test that has reported predictive validity from preterm ages to school age.25,34,35

In our study, only the sensitivity value was higher for the POPRAS compared with the TIMP. The other diagnostic values were lower. The TIMP and POPRAS are very different from each other in terms of the items and administration of the test, which affects the ease of their administration. The TIMP is a clinical test of motor performance, whereas the POPRAS is a checklist of items from the prenatal, perinatal, and neonatal medical history in the infant's medical record. This information is usually available to clinicians working in an NICU, but would not normally be available to clinicians in a developmental follow-up clinic or early intervention (birth–3 years of age) programs. The POPRAS does not involve examination of the infant.

Interpretation of individual scores in a specific setting hinges on positive and negative predictive values, which are determined by the sensitivity, specificity, and prevalence of the disorder in the population being tested.2 Clinicians should be aware that the prevalence reported in this study (23%) included all children with BOTMP z scores lower than –1.5, not only children with specific neuromotor related diagnoses. Using a cutoff z score of –1.6 on the TIMP resulted in very high predictive values: 1.00 for the positive predictive value and .87 for the negative predictive value. Although the negative predictive value is good, clinicians basing a decision on the results of this study must make recommendations knowing that some infants with negative test results will have a poor neuromotor outcome. Because the TIMP is administered at a very young age, clinicians may decide to readminister the TIMP, particularly for infants at higher risk for developmental delays, such as those with a POPRAS score above 80 or a low gestational age. To help interpret individual scores, the range of scores, based on the 95% CI, have been presented. Confidence intervals are particularly important when the sample size is small, as is the case in our study.

Limitations

Three limitations of our study need to be considered before using the TIMP and the cutoff z score of –1.6 to identify infants who will have long-term problems with motor performance. The first limitation was the small number of subjects, despite the fact that the sample was randomly selected from infants who participated in the original TIMP study. Because of the small sample size, 95% CIs for the predictive values have been reported. Second, TIMP z scores were not calculated based on data from a random sample of the population of infants who are at high risk for developmental delays. Use of standard TIMP scores based on normative data would provide more insight regarding the predictive value of the tool. Third, there was very little documentation regarding environmental variables that could have affected motor development during the interval between TIMP and BOTMP administration.


    Conclusion
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 
The predictive values we reported suggest that the TIMP is a viable tool for early identification of very young infants at risk for long-term poor motor performance. The low sensitivity suggests that the tool should be used only if some false negative results are acceptable to the clinician. Because of the high sensitivity values and a strong correlation between POPRAS and BOTMP scores, we recommend that, when feasible, the results of the 2 tests should be used together to make decisions regarding referrals for early intervention. To improve prediction, assessment of infants in the NICU may need to include not only their prenatal, perinatal, and postnatal history, but the use of a quantitative system such as the POPRAS as well.


    Footnotes
 
Both authors provided concept and research design, writing, data collection, and data analysis. Ms Flegel contributed project management. Suzann K Campbell contributed fund procurement, provision of subjects, institutional liaisons, and consultation. Glen Fujiura contributed to consultation. Karen Aukland contributed to data collection for reliability.

This study was approved by the University of Illinois at Chicago Institutional Review Board-Human Subjects Review Committee.

This work was supported, in part, by a grant from the Maternal and Child Health Bureau.

Subsequent to the submission and acceptance of this paper, Dr Kolobe became a partner in the Infant Motor Performance Scales, a provider of the Test of Infant Motor Performance.

This work was presented orally at the Combined Sections Meeting of the American Physical Therapy Association; February 2–6, 2000; New Orleans, La.

* MESA Press, 5835 Kimbark Ave, Chicago, IL 60637. Back


    References
 Top
 Abstract
 Introduction
 Method
 Results
 Discussion
 Conclusion
 References
 

  1. Girolami G, Campbell SK. Efficacy of a neuro-developmental treatment program to improve motor control in infants born prematurely. Pediatric Physical Therapy.1994; 6:175–184.
  2. Fletcher RH, Fletcher SW, Wagner EH. Clinical Epidemiology: The Essentials. Baltimore, Md: Williams & Wilkins;1996 .
  3. Aylward GP, Verhulst SJ. Predictive utility of the Bayley Infant Neurodevelopmental Screener (BINS) risk status classifications: clinical interpretation and application. Dev Med Child Neurol.2000; 42:25–31.[Web of Science][Medline]
  4. Wildin SR, Smith DE, Anderson AE, et al. Prediction of developmental patterns through 40 months from 6- and 12-month neurologic examinations in very low birth weight infants. Dev Behav Pediatr.1997; 18:215–221.[Web of Science][Medline]
  5. Morgan AM, Aldag JC. Early identification of cerebral palsy using a profile of abnormal motor patterns. Pediatrics.1996; 98:692–697.[Abstract/Free Full Text]
  6. Drillien CM. Abnormal neurologic signs in the first year of life in low-birthweight infants: possible prognostic significance. Dev Med Child Neurol.1972; 14:575–584.[Web of Science][Medline]
  7. Piper MC, Mazer B, Silver KM, Ramsay M. Resolution of neurological symptoms in high-risk infants during the first two years of life. Dev Med Child Neurol.1988; 30:26–35.[Web of Science][Medline]
  8. Coolman RB, Bennett FC, Sells CJ, et al. Neuromotor development of graduates of the neonatal intensive care unit: patterns encountered in the first two years of life. Dev Behav Pediatr.1985; 6:327–333.[Web of Science][Medline]
  9. Deitz J, Crowe T, Harris S. Relationship between infant neuromotor assessment and preschool motor measures. Phys Ther.1987; 67:14–17.[Abstract/Free Full Text]
  10. Harris SR. Early detection of cerebral palsy: sensitivity and specificity of two motor assessment tools. J Perinatol.1987; 7:11–15.[Medline]
  11. Bruininks RH. Bruininks-Oseretsky Test of Motor Proficiency Examiner's Manual. Circle Pines, Minn: American Guidance Service;1978 .
  12. Campbell SK, Kolobe THA, Osten ET, et al. Construct validity of the Test of Infant Motor Performance. Phys Ther.1995; 75:585–596.[Abstract/Free Full Text]
  13. Campbell SK. Test-retest reliability of the test of infant motor performance. Pediatric Physical Therapy.1999; 11:60–66.
  14. Campbell SK, Kolobe THA. Concurrent validity of the Test of Infant Motor Performance with the Alberta Infant Motor Scale. Pediatric Physical Therapy.2000; 12:1–8.
  15. Davidson EC, Hobel CJ. POPRAS: A Guide to Using the Prenatal, Intrapartum, Postpartum Record. Torrance, Calif: South Bay Regional Perinatal Project Professional Staff Association;1978 .
  16. Dubowitz LM, Dubowitz V, Palmer PG, et al. Correlation of neurologic assessment in the preterm newborn infant with outcome at 1 year. J Pediatr.1984; 105:452–456.[Web of Science][Medline]
  17. Cioni G, Prechtl HFR. Preterm and early postterm motor behaviour in low-risk premature infants. Early Hum Dev.1990; 23:159–191.[Web of Science][Medline]
  18. Wright BD, Masters GN. Rating Scale Analysis: Rasch Measurement. Chicago, Ill: MESA Press;1982 .
  19. Murney ME, Campbell SK. The ecological relevance of the Test of Infant Motor Performance elicited scale items. Phys Ther.1998; 78:479–489.[Abstract/Free Full Text]
  20. Hobel CJ, Hyvarinen MA, Okada DM, Oh W. Prenatal and intrapartum high-risk screening, I: prediction of the high-risk neonate. Am J Obstet Gynecol.1973; 117:1–9.[Web of Science][Medline]
  21. Ross MG, Hobel CJ, Bragonier JR, et al. A simplified risk-scoring system for prematurity. Am J Perinatol.1986; 3:339–344.[Medline]
  22. Linacre JM. FACETS: Computer Program for Many-Faceted Rasch Measurement. Chicago, Ill: MESA Press;1988 .
  23. Portney LG, Watkins MP. Foundations of Clinical Research. Norwalk, Conn: Appleton & Lange;1993 .
  24. Bond TG, Fox CM. Applying the Rasch Model: Fundamental Measurement in the Human Sciences. Mahwah, NJ: Lawrence Erlbaum Associates;2001 :106–118.
  25. Allen MC, Capute AJ. Neonatal neurodevelopmental examination as a predictor of neuromotor outcome in premature infants. Pediatrics.1989; 83:498–506.[Abstract/Free Full Text]
  26. Darrah J, Piper M, Watt M. Assessment of gross motor skills of at-risk infants: predictive validity of the Alberta Infant Motor Scale. Dev Med Child Neurol.1998; 40:485–491.[Web of Science][Medline]
  27. Ferrari F, Cioni G, Prechtl H. Qualitative changes of general movements in preterm infants with brain lesions. Early Hum Dev.1990; 23:193–231.[Web of Science][Medline]
  28. Mausner JS, Kramer S. Epidemiology: An Introductory Text. 2nd ed. Philadelphia, Pa: WB Saunders Co;1985 .
  29. Prechtl H, Einspeiler C, Cioni G, et al. An early marker for neurological deficits after perinatal brain lesions. Lancet.1997; 349:1361–1363.[Web of Science][Medline]
  30. Apgar V. A proposal for a new method of evaluation of the newborn infant. Anesthesia and Analgesia: Current Research.1953; 32:260–267.
  31. Molfese VJ, Thomson B. Optimality versus complications: assessing predictive values of perinatal scales. Child Dev.1985; 56:810–823.[Web of Science][Medline]
  32. Campbell SK, Wilhelm IJ. Development from birth to 3 years of age of 15 children at high risk for central nervous system dysfunction. Phys Ther.1985; 65:463–469.[Abstract/Free Full Text]
  33. Dworkin PH. British and American recommendations for developmental monitoring: the role of surveillance. Pediatrics.1989; 84:1000–1010.[Abstract/Free Full Text]
  34. Piper M, Darrah J, Pinnell L, et al. The consistency of sequential examinations in the early detection of neurological dysfunction. Phys Occup Ther Pediatr.1991; 11:27–44.
  35. Molteno CD, Thompson MC, Buccimazza SS, et al. Evaluation of the infant at risk for neurodevelopmental disability. S Afr Med J.1999; 89:1084–1087.[Web of Science][Medline]

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Arch. Dis. Child.Home page
J. Cairney, S. Veldhuizen, P. Kurdyak, C. Missiuna, B. E Faught, and J. Hay
Evaluating the CSAPPA subscales as potential screening instruments for developmental coordination disorder
Arch. Dis. Child., November 1, 2007; 92(11): 987 - 991.
[Abstract] [Full Text] [PDF]


Home page
ptjournalHome page
T. H. Kolobe, M. Bulanda, and L. Susman
Predicting Motor Outcome at Preschool Age for Infants Tested at 7, 30, 60, and 90 Days After Term Age Using the Test of Infant Motor Performance
Physical Therapy, December 1, 2004; 84(12): 1144 - 1156.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Rapid Responses are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Flegel, J.
Right arrow Articles by HA Kolobe, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Flegel, J.
Right arrow Articles by HA Kolobe, T.
Related Collections
Right arrow Motor Development
Right arrow Tests and Measurements
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by the American Physical Therapy Association.