Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online before print March 18, 2004, 10.1148/radiol.2312030886
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
2312030886v1
231/2/343    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Wardlaw, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Wardlaw, D.
(Radiology 2004;231:343-351.)
© RSNA, 2004


Musculoskeletal Imaging

Low Back Pain: Influence of Early MR Imaging or CT on Treatment and Outcome—Multicenter Randomized Trial1

Fiona J. Gilbert, FRCR, Adrian M. Grant, DM, Maureen G. C. Gillan, PhD, Luke D. Vale, MSc, Marion K. Campbell, PhD, Neil W. Scott, MSc, David J. Knight, FRCS and Douglas Wardlaw, FRCS, For the Scottish Back Trial Group 2

1 From Depts of Radiology (F.J.G., M.G.C.G.), Health Services Research Unit (A.M.G., M.G.C.G., L.D.V., M.K.C.), Health Economics Research Unit (L.D.V.), Public Health (N.W.S.), and Orthopaedic Surgery (D.J.K., D.W.), University of Aberdeen, Foresterhill, Aberdeen AB25 2ZD, Scotland. Members of the Scottish Back Trial Group and their affiliations are listed at the end of this article. Received Jun 12, 2003; revision requested Aug 13; final revision received Oct 31; accepted Nov 25. Supported by NHS Research & Development Health Technology Assessment Programme. Health Services Research Unit and Health Economics Research Unit supported by Chief Scientist Office, Scottish Executive Health Dept. Address correspondence to F.J.G. (e-mail: f.j.gilbert@abdn.ac.uk).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
PURPOSE: To establish whether early use of magnetic resonance (MR) imaging or computed tomography (CT) influences treatment and outcome of patients with low back pain (LBP) and whether it is cost-effective.

MATERIALS AND METHODS: In a multicenter randomized study, two imaging policies for LBP were compared in 782 participants with symptomatic lumbar spine disorders who were referred to orthopedists or neurosurgeons. Participants were randomly allocated to early (393 participants; mean age, 43.9 years; range, 16–82 years) or delayed selective (389 participants; mean age, 42.8 years; range, 14–82 years) imaging groups. Delayed selective imaging referred to imaging restricted to patients in whom a clear clinical need subsequently developed. Main outcome measures were Aberdeen Low Back Pain (ALBP) score, Short Form 36 (SF-36) score (for multidimensional health status), EuroQol (EQ-5D) score (for quality-adjusted life-year [QALY] estimates), and healthcare resource use at 8 and 24 months after randomization. Data were evaluated with analysis of covariance, ordinal logistic regression analysis, and {chi}2 and Mann-Whitney tests.

RESULTS: Both groups showed improvement in ALBP score, but this was greater in the early group (adjusted mean difference between groups, –3.05 points [95% CI: –5.16, –0.95; P = .005] and –3.62 points [95% CI: –5.92, –1.32; P = .002] at 8 and 24 months, respectively). Scores for SF-36 (bodily pain domain) and EQ-5D were also significantly better at 24 months. Clinical treatment was similar in both groups. Differences in total costs reflected cost of imaging. Imaging provided an adjusted mean additional QALY of 0.041 during 24 months at a mean incremental cost per QALY of $2,124.

CONCLUSION: Early use of imaging does not appear to affect treatment overall. Decisions about the use of imaging depend on judgments concerning whether the small observed improvement in outcome justifies additional cost.

© RSNA, 2004

Index terms: Cost-effectiveness • Efficacy study • Spine, CT, 33.1211 • Spine, MR, 33.1214


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Low back pain (LBP) is a common reason for health care consultations (1,2). Although most episodes are self-limiting, up to 20% of patients in the United Kingdom are referred for the opinion of a secondary care specialist (3). The majority of patients referred to secondary care specialists receive conservative treatment, and the role of imaging in treatment of these patients is uncertain, poorly defined, and variable (4). Some clinicians routinely request imaging to confirm their diagnosis and provide reassurance. Others limit use to patients who require interventional treatment or who have signs of potentially serious abnormalities or disease, for they argue that imaging could provide misleading information, generate unnecessary anxiety, and lead to inappropriate treatment (5).

It is unclear which of the diagnostic imaging pathways is most effective and cost-effective and how the imaging impacts on patient treatment. Thus, in the middle 1990s the place of sophisticated imaging in the treatment of LBP was identified as a research priority by the UK National Health Service (NHS) Research and Development Health Technology Assessment Programme (6).

The purpose of our study was to establish whether early use of magnetic resonance (MR) imaging or computed tomography (CT) influences treatment and outcome of patients with LBP and whether it is cost-effective. A pragmatic (7) multicenter study design was used to evaluate two imaging policies for LBP as might be applied in routine clinical practice settings in the UK NHS. Early imaging (implying liberal use of imaging) was compared with delayed selective imaging (implying use restricted to patients in whom a clear clinical need subsequently developed).


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Clinical Centers and Study Population
Patients were recruited between November 1996 and June 1999 from seven teaching hospitals and eight district general hospitals in the United Kingdom by 24 orthopedists and one neurosurgeon. Approval was given by the Scottish Multicenter Research Ethics Committee and the appropriate local research ethics committees, and all participants provided written informed consent. All the centers were part of the UK NHS, which is publicly funded through general taxation and provides health care that is generally free at the point of use. Privately funded health care exists, and its use was measured in the randomized trial; however, in the context of the NHS it would represent a minor component of the care provided.

All new patients who had symptomatic lumbar spine disorders at presentation were eligible for the trial if there was clinical uncertainty about the need for imaging (MR imaging or CT). Patients were excluded who required immediate referral for imaging (eg, those who had signs suggestive of serious abnormalities or disease ["red flags"] or who required surgical intervention), who had undergone MR imaging or CT of the spine in the previous 12 months, who did not need imaging (eg, those who were discharged to primary care), and who had pain of a nonspinal origin.

Principal Outcome Measures
A variety of outcome measures are now available to measure health outcomes for patients with LBP, and these include generic instruments that provide a summary of overall health and more sensitive condition- or disease-specific instruments that focus on LBP (8,9). The primary outcome measure was the Aberdeen Low Back Pain (ALBP) score. This condition-specific questionnaire allows assessment of LBP across several dimensions, including pain, physical impairment, and functional disability. It has been rigorously validated and shown to be more responsive to clinical change than is the Short Form 36 (SF-36) general health status questionnaire (10,11). Responses to the 19-item questionnaire are summed and converted to a percentage score (scores range from 0 for least disabled to 100 for most disabled). Other principal outcome measures were the SF-36, a generic instrument that is widely used and has been shown to be a reliable and valid instrument for the assessment of functional status (12), and the Euroqol (EQ-5D) (13), a generic instrument for measurement of health status that was specifically developed for the derivation of quality-adjusted life-years (QALYs). The SF-36 has eight subscales (physical and social functioning, physical and emotional role limitation, mental health, vitality, bodily pain, general health) that are each scored from 0 (poorest health) to 100 (best health). The EQ-5D includes five items (mobility, self-care, usual activities, pain or discomfort, and anxiety and depression) from which a single utility score may be derived (scores range from –0.59 for worst possible health state to +1 for best possible health state). In addition, measures of health service and participant resource use were also evaluated.

Data Collection
After we obtained informed consent, research nurses collected baseline clinical and demographic details, and participants completed health status questionnaires prior to random assignment to groups. At 8 and 24 months, health status measures and information about primary care consultation, purchase of prescription and nonprescription medicines, non-NHS treatment (eg, private physiotherapy and osteopathy), and time off work and discontinuation or interruption of usual activities because of LBP were collected with postal self-completion questionnaires. Telephone calls and reminder letters were used to increase the return of questionnaires.

The validity of information in regard to NHS resource use obtained from questionnaires was assessed by comparing patient recall with data abstracted from case notes for the first 100 patients recruited to the trial. The results of this comparison showed that although data may have been available from fewer people, questionnaires provided data that were broadly similar to information in hospital notes but covered items that were not recorded in hospital case notes, such as primary care physician consultations. Data about description of events that were available in case notes (eg, epidural injections) during the 24 months after trial entry were collected retrospectively from hospital case notes by trained researchers. These researchers were blinded to the original randomization allocation. Data were supplemented with information from the postal questionnaires. One researcher was responsible for data abstraction from all but one of the centers, and this researcher cross-checked the validity of the data abstracted by the researcher at the remaining center.

Randomization
Throughout the trial, we used a system of "distant" randomization with which we ensured complete separation of the randomization process from those providing care. For the first 66 (8%) patients, a system of simple randomization was used, and this was superseded by our automated system in which we incorporated minimization to ensure a balance in key prognostic variables. Assignment to the early imaging group (in which MR imaging or CT was performed as soon as was practicable) or the delayed selective imaging group (in which no MR imaging or CT was performed unless a clear clinical indication subsequently developed [eg, a decision to perform surgery]) was accomplished with the method of minimization (14). Hereafter, the "early imaging" and the "delayed selective imaging" groups will be termed "early" and "delayed selective" groups, respectively. Groups were stratified according to individual clinician with balancing for age (four age bands: <21, 21–40, 41–60, >60 years), sex, and diagnostic category. On the basis of clinical criteria, patients were assigned to one of five diagnostic categories: (a) symptomatic lumbar disk herniation, (b) root entrapment secondary to degenerative disease, (c) neurogenic claudication, (d) chronic LBP not caused by the previous three conditions, and (e) other LBP (spondylosis, spondylolisthesis, sacroiliitis, pathologic fracture, and osteoporosis).

Imaging modality and patient treatment plan were chosen at the discretion of the referring clinician.

Sample Size
The original aim was to recruit 1,200 participants to provide 90% power to identify a difference of 3.0 points in the ALBP score and 80% power to detect a difference of 2.5 points (P < .05, two-tailed test). Following lower than expected recruitment and reconsideration of the size of difference that would be clinically significant, the recruitment number was revised to 800 at the time of the data monitoring committee meeting. This provided 90% power to detect a difference of 3.7 points in the ALBP score and 80% power to detect a difference of 3.2 points (P < .05, two-tailed test).

Statistical Analysis of Noneconomic Data
Categoric variables were analyzed with the {chi}2 test, and continuous outcomes were evaluated with the Mann-Whitney test (where data were not normally distributed). For the ALBP score, the EQ-5D score, and scores for six of the subscales of SF-36, the primary method was analysis of covariance with adjustment for the factors used in the minimization method (age, sex, diagnostic category, and clinician) and the score at baseline. Except for the ordinal analyses, we also adjusted for the consultant and treated that adjustment as a random factor in the model. The data are reported as the adjusted mean scores and differences in means with 95% CIs. The three subscales of SF-36 with six or fewer possible responses were treated as ordinal outcomes, and ordinal logistic regression analysis with adjustment for the minimization factors and the score at baseline was used. All data were analyzed on an intention-to-treat basis. Preplanned secondary analyses were stratified according to diagnostic category, buttock or leg pain, duration of current episode, and trial center.

Economic Data and Statistical Analysis
Derivation of NHS costs.—Total average costs for the two trial policies were derived from estimated costs that were based on changes in treatment. The areas of treatment considered were related to hospital-based services (outpatient consultation; imaging; physiotherapy; hospital admission; surgery; injection; provision of back supports, corsets, or braces), primary care services (general practitioner visits, use of prescription and nonprescription medicines), and other tests (blood and urine tests) and devices. Unit costs were obtained from published sources (1517) or a costing exercise conducted as part of the trial in the six hospitals from which the majority of participants were recruited. Existing sources of cost data did not provide sufficient detail to allow an event, such as a surgical operation, to be costed out. In these situations, a "bottom-up" costing exercise was conducted in the six hospitals from which the majority of participants were recruited. Bottom-up costing involved identification of the staff, materials (both disposable and reusable), and relevant overhead costs (heat, power for imaging units, light, building costs) required to provide a procedure or test.

Staff costs were based on national salary scales plus additions for national insurance and superannuation (method of contribution to pension scheme). Missing questionnaire data were imputed primarily by using mean imputation, as the quantity of missing data for any given variable was small. Regression imputation was used for two variables identified as heavy cost drivers (number of primary physician care visits and of physiotherapy sessions). The cost per patient was the cost per event multiplied by the number of events. The UK treasury discount rate of 6% was applied to costs incurred in the 2nd year of follow-up and the amortized costs of reusable equipment. A cost per patient was estimated by dividing the equivalent annual cost by the number of people who would be expected to use the equipment in a year. Costs were derived in 2001–2002 UK pounds and converted into U.S. dollars with the exchange rate of U.S. $1.44 to UK £1.

Derivation of QALYs.—The QALYs were estimated with the standard EQ-5D UK tariff. The response health state tariff was calculated for each participant by using standard syntax (SPSS; EuroQol, York, England). Adjusted mean additional QALYs were calculated by using analysis of covariance and by adjusting for the factors used in minimization and determination of baseline EQ-5D scores (18).

Assessment of cost using utility.—With established economic methods, the incremental cost per QALY was estimated from the mean costs and effectiveness differences between the groups (19). Distribution-free methods were used for statistical inference with respect to costs, QALYs, and cost per QALY because of skewed distributions (20). This approach incorporates the uncertainty surrounding estimates of cost per patient caused by differences in health care resource use and the uncertainty surrounding QALY estimates. Bootstrap bias–corrected methods were used to estimate CIs for the difference in QALYs and the incremental cost per QALY.

The estimates of cost per QALY derived with the bootstrap method are presented in terms of a cost-effectiveness acceptability curve. The cost-effectiveness acceptability curve allows a decision-maker to consider whether the intervention is cost-effective in relation to some value he or she thinks is the maximum cost worth paying for a QALY. At each ceiling value for the willingness of society to pay for a QALY, the cost-effectiveness acceptability curve shows the probability that the treatment would be cost-effective.

Sensitivity analysis.—The main determinant of the differences in cost between early and delayed selective imaging was the cost of an image obtained with an MR imaging unit or CT scanner. The cost of imaging varied between the average observed cost of $129 and $720 or £500, which represents the upper estimate of the average UK cost of an MR image when all building and overhead costs are included. For illustrative purposes, the analysis was also repeated for an average cost of $432 or £300.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Characteristics of Groups at Trial Entry
A total of 2,657 patients were evaluated for trial eligibility. Of the 782 (29%) patients formally recruited, 393 were randomly allocated to the early group, and 389 were allocated to the delayed selective group. Figure 1 shows the progress of participants through the trial.



View larger version (65K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Diagram shows progress of participants through trial.

 
The clinical and demographic characteristics of participants in the two groups were similar at trial entry with respect to age, sex, and diagnostic category (Table 1). The duration of the current episode of LBP tended to be slightly shorter in the early group; there was also a baseline imbalance in health status, with patients in the early group having better scores with all three outcome measures (Table 2).


View this table:
[in this window]
[in a new window]

 
TABLE 1. Description of Groups at Trial Entry

 

View this table:
[in this window]
[in a new window]

 
TABLE 2. Quality of Life Outcome Measures at Trial Entry and at 8 and 24 Months

 
Impact on Treatment
At 24 months after trial entry, 353 (90%) of 393 patients in the early group and 115 (30%) of 389 in the delayed selective group had undergone imaging at a median time of 4.1 and 20.1 weeks, respectively (Table 3). A larger proportion of participants in the early group had a subsequent outpatient appointment (difference, 15.6%; 95% CI for difference: 9.7%, 21.5%; P < .001) (Table 3). However, consultation was less frequently used in the early group in the 2nd year. Overall, the total number of consultations was similar in the two groups (mean, 1.91 vs 1.88). In both groups, approximately 248 (60%) of 393 participants had received NHS-provided physiotherapy. There was no detectable difference in number of hospital admissions (7.9% [31 of 393] vs 6.7% [26 of 389]; difference, 1.2%; 95% CI: –2.4%, 4.9%), in number of surgical operations (6.9% [27 of 393] vs 5.1% [20 of 389]; difference, 1.7%; 95% CI: –1.6%, 5.1%), or in use of injections (17.8% [70 of 393] vs 19.3% [76 of 389]; difference, 1.5%; 95% CI: –7.2%, 3.7%).


View this table:
[in this window]
[in a new window]

 
TABLE 3. Imaging Data and Treatment Received

 
Impact on Health
Both groups showed improvement in scores from baseline at 8 months, with further improvement at 24 months (Table 2). At both time points, the scores were generally better in the early group. However, some of this difference reflected imbalance at baseline; moreover, with these comparisons, the effects of minimization at trial entry were not considered. After adjustment for age, sex, diagnostic category, center, and specialist, as well as baseline score, the difference between groups in the mean ALBP score was –3.05 (95% CI: –5.16, –0.95; P = .005) at 8 months and –3.62 (95% CI: –5.92, –1.32; P = .002) at 24 months.

For the SF-36 score, the clearest difference was in the bodily pain subscale score, for which the adjusted mean difference was 4.54 (95% CI: 1.23, 7.86; P = .007) at 8 months and 5.14 (95% CI: 1.61, 8.67; P = .004) at 24 months. For other subscales, the adjusted scores were generally better in the early group, and the differences were statistically significant at 8 months for social functioning, vitality, and reported health transition.

The EQ-5D score also showed improvement in the early group, compared with the delayed selective group, significantly so at 24 months (adjusted difference, 0.057; 95% CI: 0.013, 0.101; P = .01 [Table 2]). Secondary analyses stratified according to diagnostic category, the presence of buttock or leg pain at trial entry, duration of current episode, or the clinical center for recruitment of patients showed adjusted differences in means for the ALBP score at 24 months that were all statistically compatible with the overall result (Fig 2).



View larger version (25K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. Graph shows subgroup analysis of ALBP score at 24 months after trial entry, with adjusted difference in ALBP score and 99% CIs for subgroups and 95% CIs for overall trial result. This analysis indicates that subgroup data were generally consistent with overall trial results (dotted vertical line), that is, greater improvement in health was observed in the early group. Solid vertical line denotes no difference between the groups.

 
Economic Evaluation
Table 4 summarizes the cost per patient in the two groups. The mean total cost per patient for the early group was $701 (median, $434) and for the delayed selective group was $614 (median, $329). The mean QALYs per patient during 2 years were 1.114 (median, 1.250) for the early group and 1.027 (median, 1.130) for the delayed selective group. Adjustment for the imbalance in EQ-5D scores at baseline resulted in a difference of 0.041 QALYs (95% CI: –0.006, 0.087). The difference in cost was $87 (95% CI: –$41, $209). On average, the early imaging policy provided one additional QALY at an estimated cost of $2,124, which was calculated as follows: ($701 – $614)/0.041.


View this table:
[in this window]
[in a new window]

 
TABLE 4. Total Cost per Patient and Cost for Each Area of Resource Use in Each Group

 
There was an approximately 9% chance that the early imaging policy was both more effective and less costly than the delayed selective imaging policy, a 4% chance that it was less costly but less effective, and less than a 0.5% chance that it was more costly and less effective. Overall, there was a 94.9% probability that the incremental cost per QALY would be less than $50,000 (Fig 3). These estimates are most sensitive to changes in the cost of imaging. If the average cost is $720 rather than $130, the equivalent probability is 90%. Judgments about cost-effectiveness depend on the worth attached to a QALY. For example, if society were willing to pay $10,000 or $5,000 for each QALY gained, then there would be an 89% or a 77% chance, respectively, that early imaging would be cost-effective.



View larger version (24K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3. Graph shows cost-effectiveness acceptability curve. Image illustrates the impact on the incremental cost per QALY of changes in the cost of imaging. As the cost of imaging increases, the likelihood that an early imaging strategy is cost-effective decreases.

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Participants in both groups reported an improvement in health status during the 24 months after trial entry, but there was a significantly greater improvement in participants in the early group at both 8 and 24 months when the ALBP score, the bodily pain subscale score of the SF-36, and the EQ-5D score were considered. The estimated mean incremental cost per QALY was $2,124. However, the differences in outcome were all small.

There were no significant differences between the two groups in overall clinical treatment, except for the use of imaging and the timing of outpatient appointments. Also, the pattern of changes in ALBP scores was more consistent, with small improvements in a large number of participants rather than a large improvement in a few participants. These findings suggest that imaging may have a small direct effect, perhaps through reassurance, rather than an indirect effect, through changes in treatment in a minority of patients. This observation is consistent with the findings of a study (21) about the influence of imaging on clinical decision making that showed greater clinician confidence in the diagnosis in the early group; this confidence might have been transferred to the patient. Improvement in patient psychologic well-being and satisfaction has been reported in two small trials with radiography for patients with LBP, but no differences in patient outcome were observed (22,23). In addition, investigators in a study (24) of a randomized comparison of radiography and MR imaging for patients with LBP reported similar patient outcomes in both groups. Results of that study indicated that there was no measurable improvement in functional status or health-related quality of life; however, the MR imaging group had greater self-reported reassurance and satisfaction.

In this study, participants were allocated to groups by using the method of minimization. Although this is essentially a nonrandom method, we included a random element in the allocation process to ensure that those who were recruiting participants could never have predicted the allocation with certainty. We also adjusted for the minimization factors in the analysis, as failure to do so could have resulted in overconservative results (14). Although the groups were similar in regard to the minimization factors of age, sex, specialist, and diagnostic category, there were more participants with a shorter duration of episode in the early group. Furthermore, there were also differences in the baseline score of all three outcome measures, with the early group reporting better scores. Nevertheless, the differences in outcome persisted after adjustment for baseline differences. In the interpretation of results, one should also consider possible imprecision in the estimates. On the basis of the 95% CIs, the true difference (after baseline adjustment) in ALBP score at 24 months is likely to be between –1.32 and –5.92. The decision about whether or not any effect is clinically and economically important may be substantially influenced by whether the true effect is at the upper or lower end of this range.

Although we did not expect any effects of early imaging to be large because of the chain of events between imaging and the time of follow-up assessment of health status (25), the clinical relevance of the small observed change that benefited the early group is difficult to assess. Garratt and colleagues (11) compared the responsiveness of health status questionnaires and reported a change of approximately 7.5 points in the ALBP score for patients in the control group who said they were better at 1 year. Similarly, these patients recorded a score change of 0.14 points with the EQ-5D score. In this context, our observed score difference of –3.62 points in the ALBP score and 0.057 in the EQ-5D score may not represent meaningful changes to patients. In a comparison of the estimates of effect derived from different instruments, such as the UK Back Pain Exercise and Manipulation Trial (Garratt AM, personal communication, 2001), patients who reported "no change," "slightly improved," or "much improved" had differences in ALBP scores between the ratings of –3.25 and –5.29.

With our own data, we compared the score for the reported health transition subscale of the SF-36 with the ALBP score at 24 months. The majority of patients in our study reported the status as "somewhat worse," "about the same," and "somewhat better," and the differences in ALBP scores between these strata were –4.24 and –10.28. However, it should be noted that the reported health transition subscale of the SF-36 relates to general health and that patient responses to the question for this subscale will be influenced by changes in not only back pain but also other health problems. Results of a comparison of SF-36 score changes with subjective reports of improvement in a group of patients with sciatica suggested that a seven-point difference in the SF-36 physical functioning and bodily pain subscales was consistent with a clinically important difference (26). Although this finding suggests that our observed difference of 4.54 points (95% CI: 1.23, 7.86) in the bodily pain subscale score may not be clinically important, caution is required in generalizing results to all categories of LBP for which the prognosis and natural history may differ (27).

Since the patients in this study had baseline scores for SF-36 and EQ-5D that were lower (ie, poorer health) than scores reported in two studies (22,23) of patients with LBP referred by primary care physicians for plain radiography, our data may only be generalizable to the patients with more severe disability who are referred to secondary care specialists.

In the secondary stratified analyses, estimates of effect were similar in the various clinical categories and consistent with the overall trial results, and there was no clear evidence of a larger or smaller effect in any subgroup of participants. In relation to duration of symptoms, the pattern of changes in outcome scores did not follow the expectation that the improvement would be greatest among those with pain for less than 3 months. Instead, the largest difference in the mean score was in the group of participants who had pain between 3 and 12 months.

Although there was a statistically significant difference in EQ-5D scores at 24 months, this did not equate to a difference in QALYs that was statistically significant at 24 months. The reason for this is that the QALY represents cumulative quality-adjusted survival at 24 months and as such incorporates the 8-month EQ-5D data. Nevertheless, the evidence suggests that there is more than a 95% probability that early imaging is associated with greater QALYs at 24-month follow-up. Furthermore, the economic evaluation indicated that there is a 9% probability that imaging is both less costly and more effective and approximately a 95% chance that the incremental cost per QALY is less than $50,000. These estimates are, however, sensitive to changes in the cost of imaging, the main cost driver.

With respect to study limitations, there was a baseline imbalance in the ALBP, SF-36, and EQ-5D scores, with the early group reporting better health status. In addition, at presentation, the episode of back pain tended to be longer in the delayed selective group. Furthermore, patients with a shorter duration of back pain may be expected to have better outcome (28,29). Nevertheless, differences in outcome persisted after adjustment for baseline values, and the values included in the study were obtained after adjustment. At trial entry, it was intended that patients complete the baseline questionnaire prior to notification of their randomization allocation. However, if patients were notified of their allocation while they were in the process of completing the questionnaire, the knowledge of being referred for imaging might have affected their self-reporting of health status (30).

Since baseline scores for the SF-36 and EQ-5D were lower (ie, they indicated poorer health) than scores reported in studies (2224) involving patients with LBP in primary care settings, our data may only be generalizable to patients with more severe disability who are referred to secondary care specialists. Furthermore, differences in waiting times and access to secondary care and imaging services could influence the generalizability of the study results. In the economic evaluation, biases may have arisen in the participants’ recall of health service use and in regard to missing data. However, the concordance between the alternative data sources was high; because of the efforts made to ensure that questionnaires were returned and to assess all hospital records, the total quantity of missing data was relatively low.

Decisions about the use of sophisticated imaging will depend to an important extent on judgments about the value of the observed differences in outcome and whether they are worth the extra costs of early imaging. The use of MR imaging does not appear to affect treatment overall, and the small observed improvement in outcome is of questionable clinical importance. Although some researchers may argue that any improvement is worthwhile, given that the other costs of treatment do not appear to be increased, others may say that the cost of providing a small improvement in patients’ overall well-being is not justifiable, especially when there are competing demands for MR imaging resources.


    ACKNOWLEDGMENTS
 
Andrew M. Garratt, PhD, University of Oxford, England, contributed to the interpretation of the outcome scores. Richard W. Porter, FRCS, helped develop the original grant proposal. We are indebted to the clinicians and nurses at the collaborating centers for their assistance in participant recruitment.


    FOOTNOTES
 
2The Scottish Back Trial Group: Clinical collaborators: Douglas Wardlaw, FRCS, David J. Knight, FRCS(Ortho), George P. Ashcroft, FRCS(Ortho), Rabindra N. Bhattacharrya, MBBS, James D. Hutchison, FRCS, George F. Kaar, FRCS, Francis W. Smith, FRCR, Olive J. Robb, FRCR, Fiona J. Gilbert, FRCR (Aberdeen Royal Infirmary and Woodend Hospital, Aberdeen); Neil W. Valentine, FRCS, John G. Houston, FRCR, John A. Tainsh, FRCR (Ninewells Hospital, Dundee; Stracathro Hospital, Brechin; Arbroath Infirmary); David Anderson, FRCS, Johannis G. Kilian, FRCS, Marthinus F. Roos, FRCS, John Addison, FRCR (Dr Gray’s Hospital, Elgin); Anthony T. Reece, FRCS, Nigel D. Raby, FRCR (Western Infirmary, Glasgow); Eugene F. Wheelwright, FRCS, John Shand, FRCR (Stobhill Hospital, Glasgow); Alastair W. Forrester, FRCR (Glasgow Royal Infirmary); Alberto Gregori, FRCS(Orth) (Stonehouse Hospital and Hairmyres Hospital, East Kilbride); David M. Bell, FRCS, Alexander C. Campbell, FRCS, Kenneth Wallers, FRCR (Monklands Hospital, Airdrie); Klas E. R. Buring, MD, Josephine E. Barry, FRCR (Falkirk Royal Infirmary); James P. M. Campbell, MD (Princess Margaret Rose Orthopaedic Hospital, Edinburgh); I. Beggs, FRCR (Royal Infirmary of Edinburgh); Bhu D. Sharma, FRCS (Law Hospital, Carluke); Peter L. Sanderson, FRCS, Michael J. Gibson, FRCS (Newcastle General Infirmary); David Aaron, FRCS, Mary J. Stevenson, FRCR (Royal Alexandra Hospital, Paisley). Data coordinators: J. Longhurst, S. Campbell, S. McLeer, J. E. Andrew (Health Services Research Unit, University of Aberdeen); M. L. Muirhead (Department of Radiology, University of Aberdeen); E. McIntosh (Health Economics Research Centre, University of Oxford); D. Arnold (Department of Orthopaedic Surgery, Ninewells Hospital, Dundee). Back

The views expressed in this article are those of the Scottish Back Trial Group.

Abbreviations: ALBP = Aberdeen Low Back Pain, EQ-5D = EuroQol, LBP = low back pain, NHS = National Health Service, QALY = quality-adjusted life-year, SF-36 = Short Form 36

Author contributions: Guarantor of integrity of entire study, F.J.G.; study concepts, F.J.G., A.M.G., D.W.; study design, F.J.G., A.M.G., M.G.C.G., M.K.C., D.W., D.J.K.; literature research, F.J.G., M.G.C.G.; clinical studies, F.J.G., D.W., D.J.K.; data acquisition, M.G.C.G., N.W.S., L.D.V.; data analysis/interpretation, L.D.V., N.W.S.; statistical analysis, N.W.S., L.D.V.; manuscript preparation, M.G.C.G., L.D.V., F.J.G., A.M.G., N.W.S., D.W., D.J.K.; manuscript definition of intellectual content, M.G.C.G., L.D.V., F.J.G., A.M.G.; manuscript editing, F.J.G., A.M.G., M.K.C.; manuscript revision/review, F.J.G., A.M.G., L.D.V., N.W.S., M.K.C.; manuscript final version approval, all authors


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Andersson GB. Epidemiological features of chronic low-back pain. Lancet 1999; 354:581-585.[CrossRef][Medline]
  2. Clinical Standards Advisory Group. Back pain London, England: Her Majesty’s Stationery Office, 1994.
  3. Croft P, Papageorgiou A, McNally R. Low back pain. In: Stevens A, Raftery J, eds. Health care needs assessment, series 2. Oxford, England: Radcliffe Medical, 1997; 145-181.
  4. Elkayam O, Avrahami E, Yaron M. The lack of prognostic value of computerized tomography imaging examinations in patients with chronic non-progressive back pain. Rheumatol Int 1996; 16:19-21.[CrossRef][Medline]
  5. Deyo RA. Magnetic resonance imaging of the lumbar spine: terrific test or tar baby? N Engl J Med 1994; 331:115-116.[Free Full Text]
  6. Health Technology Assessment Advisory Group. Assessing the effects of health technologies London, England: Department of Health, 1992.
  7. Roland M, Torgerson DJ. Understanding controlled trials: what are pragmatic trials? BMJ 1998; 316:285-286.[Free Full Text]
  8. Kopec JA. Measuring functional outcomes in persons with back pain: a review of back-specific questionnaires. Spine 2000; 25:3110-3114.[CrossRef][Medline]
  9. Taylor SJ, Taylor AE, Foy MA, Fogg AJ. Responsiveness of common outcome measures for patients with low back pain. Spine 1999; 24:1805-1812.[CrossRef][Medline]
  10. Ruta D, Garratt AM, Wardlaw D, Russell IT. Developing a valid and reliable measure of health outcome for patients with low back pain. Spine 1994; 19:1887-1896.[Medline]
  11. Garratt AM, Klaber Moffett J, Farrin AJ. Responsiveness of generic and specific measures of health outcome in low back pain. Spine 2001; 26:71-77.[CrossRef][Medline]
  12. Ware JE. SF-36 health survey update. Spine 2000; 25:3130-3139.[CrossRef][Medline]
  13. Tosteson AN. Preference-based health outcome measures in low back pain. Spine 2000; 25:3161-3166.[CrossRef][Medline]
  14. Scott NW, McPherson GC, Ramsay CR, Campbell MK. The method of minimization for allocation to clinical trials: a review. Control Clin Trials 2002; 23:662-674.[CrossRef][Medline]
  15. British National Formulary Number 40, 2001 London, England: British Medical Association and Royal Pharmaceutical Society of Great Britain, 2001.
  16. Netten A, Curtis I. Unit costs of health and social care Kent, England: : Personal Social Services Research Unit, University of Kent, 2000.
  17. Scottish Health Service Costs 2000–01 Edinburgh, Scotland: Information and Statistics Division, 2001.
  18. Manca A, Richardson G, Hawkins N, Sculpher M. A note on the estimation, the reporting and the use of patient-level QALYs in economic evaluation alongside randomised controlled trials. Presented at the Health Economists’ Study Group Meeting, University of Leeds, England January 8–10, 2003.
  19. Briggs AH, Gray AM. Handling uncertainty when performing economic evaluation of healthcare interventions. Health Technol Assess 1999; 3:1-134.[Medline]
  20. Barber JA, Thompson SG. Analysis and interpretation of cost data in randomised controlled trials: review of published studies. BMJ 1998; 317:1195-1200.[Abstract/Free Full Text]
  21. Gillan MG, Gilbert FJ, Andrew JE, et al. Influence of imaging on clinical decision making in the treatment of lower back pain. Radiology 2001; 220:393-399.[Abstract/Free Full Text]
  22. Kerry S, Hilton S, Patel S, Dundas D, Rink E, Lord J. Routine referral for radiography of patients presenting with low back pain: is patients’ outcome influenced by GPs’ referral for plain radiography? Health Technol Assess 2000; 4(20):i–iv-1–119.
  23. Kendrick D, Fielding K, Bentley E, Kerslake R, Miller P, Pringle M. Radiography of the lumbar spine in primary care patients with low back pain: randomised controlled trial. BMJ 2001; 322:400-405.[Abstract/Free Full Text]
  24. Jarvik JG, Hollingworth W, Martin B, et al. Rapid magnetic resonance imaging vs radiographs for patients with low back pain. JAMA 2003; 289:2810-2818.[Abstract/Free Full Text]
  25. Hollingworth W, Mackenzie R, Todd CJ, Dixon AK. Measuring changes in quality of life following magnetic resonance imaging of the knee: SF-36, EuroQol or Rosser index? Qual Life Res 1995; 4:325-334.[CrossRef][Medline]
  26. Patrick DL, Deyo RA, Atlas SJ, Singer DE, Chapin A, Keller RB. Assessing health-related quality of life in patients with sciatica. Spine 1995; 20:1899-1909.[Medline]
  27. Saal JA, Saal JS. Intradiscal electrothermal treatment for chronic discogenic low back pain: a prospective outcome study with minimum 1-year follow-up. Spine 2000; 25:2622-2627.[CrossRef][Medline]
  28. Waddell G. Low back pain: a twentieth century health care enigma. Spine 1996; 21:2820-2825.[CrossRef][Medline]
  29. Von Korff M, Deyo RA, Cherkin DC, Barlow W. Back pain in primary care: outcomes at 1 year. Spine 1993; 18:855-862.[Medline]
  30. Sox HC, Jr, Margulies I, Sox C. Psychologically mediated effects of diagnostic tests. Ann Intern Med 1981; 95:680-685.



This article has been cited by other articles:


Home page
J Bone Joint Surg BrHome page
K.-A. Jansson, G. Nemeth, F. Granath, B. Jonsson, and P. Blomqvist
Health-related quality of life (EQ-5D) before and one year after surgery for lumbar spinal stenosis
J Bone Joint Surg Br, February 1, 2009; 91-B(2): 210 - 216.
[Abstract] [Full Text] [PDF]


Home page
J Am Board Fam MedHome page
R. A. Deyo, S. K. Mirza, J. A. Turner, and B. I. Martin
Overtreating Chronic Back Pain: Time to Back Off?
J Am Board Fam Med, January 1, 2009; 22(1): 62 - 68.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Neuroradiol.Home page
L.M. Ash, M.T. Modic, N.A. Obuchowski, J.S. Ross, M.N. Brant-Zawadzki, and P.N. Grooff
Effects of Diagnostic Information, Per Se, on Patient Outcomes in Acute Radiculopathy and Low Back Pain
AJNR Am. J. Neuroradiol., June 1, 2008; 29(6): 1098 - 1103.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
R. Ouwendijk, M. de Vries, T. Stijnen, P. M. T. Pattynama, M. R. H. M. van Sambeek, J. Buth, A. V. Tielbeek, D. A. van der Vliet, L. J. SchutzeKool, P. J. E. H. M. Kitslaar, et al.
Multicenter Randomized Controlled Trial of the Costs and Effects of Noninvasive Diagnostic Imaging in Patients with Peripheral Arterial Disease: The DIPAD Trial
Am. J. Roentgenol., May 1, 2008; 190(5): 1349 - 1357.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
R. Chou, A. Qaseem, V. Snow, D. Casey, J. T. Cross Jr, P. Shekelle, D. K. Owens, and for the Clinical Efficacy Assessment Subcommittee
Diagnosis and Treatment of Low Back Pain: A Joint Clinical Practice Guideline from the American College of Physicians and the American Pain Society
Ann Intern Med, October 2, 2007; 147(7): 478 - 491.
[Abstract] [Full Text] [PDF]


Home page
Clin TrialsHome page
S. M Kymes, K. Lee, J. W Fletcher, and SNAP (CSP 027) Study Group
Assessing diagnostic accuracy and the clinical value of positron emission tomography imaging in patients with solitary pulmonary nodules (SNAP)
Clinical Trials, February 1, 2006; 3(1): 31 - 42.
[Abstract] [PDF]


Home page
ImagingHome page
J Teh, A Imam, and C Watts
Imaging of back pain
Imaging, December 1, 2005; 17(3): 171 - 207.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
R. Ouwendijk, M. de Vries, P. M. T. Pattynama, M. R. H. M. van Sambeek, M. W. de Haan, T. Stijnen, J. M. A. van Engelshoven, and M. G. M. Hunink
Imaging Peripheral Arterial Disease: A Randomized Controlled Trial Comparing Contrast-enhanced MR Angiography and Multi-Detector Row CT Angiography
Radiology, September 1, 2005; 236(3): 1094 - 1103.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
E. J. Carragee
Persistent Low Back Pain
N. Engl. J. Med., May 5, 2005; 352(18): 1891 - 1898.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
2312030886v1
231/2/343    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Wardlaw, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Wardlaw, D.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE