|
|
||||||||
Editorials |
1 From the Department of Radiology, the University of Virginia Health System, PO Box 800170, Charlottesville, VA 22908 (B.J.H.); and Center for Statistical Sciences, Program in Public Health, Brown University, Providence, RI (C.A.G.). Received December 21, 2007; final version accepted February 21, 2008. Address correspondence to B.J.H. (e-mail: bjh8a{at}virginia.edu).
Discuss this article at www.rsna.org/radiology/discuss
When is the right time to conduct a clinical trial of a diagnostic imaging technology? It is an important question. The practice of radiology has grown in both scientific and economic influence almost solely because of the dramatic technologic advancements of the past 4 decades. The continuation of the innovation-evaluation-dissemination process is essential to the future robustness of the specialty. In this continuum, evaluation—the process of technology assessment—is playing an increasingly important role as employers and payers demand more definitive evidence of the value of a technology in improving care before granting reimbursement. Without reimbursement, a technology will not be successful.
Unfortunately, there is no simple answer to the question, "when is the best time?" The answer depends on a multiplicity of factors, including the nature of the technology, its intended application, the stage of development and diffusion into practice, from whose perspective the technology is being viewed, and what we will broadly refer to as nonscientific issues. This editorial will address these influences in an effort to provide readers with a better understanding of the thinking involved in clinical trial decision making.
The first step is to define what is meant by a clinical trial of a diagnostic imaging technology. For purposes of this presentation, an imaging clinical trial is any research involving the use of a diagnostic imaging technology for a clinical application on living humans. Thus, we will not consider, for example, the imaging of animals or human specimens or the preclinical testing of devices for their spatial, contrast, or temporal resolution. What we will consider is the evaluation of devices, biologic agents, and pharmaceuticals for the three major applications of diagnostic imaging to clinical practice: screening and early detection of disease, diagnosis and staging of disease, and employment of imaging as a prognostic and/or predictive marker of treatment response.
| THE DICTATES OF TECHNOLOGY |
|---|
|
|
|---|
The situation is more complex with imaging devices. There is no universally established phased approach. In addition, the level of evidence demanded by the FDA for approval of most new devices tends to be lower than it is for new biologics and pharmaceuticals. As a result, the assessment of new imaging devices tends to be more diffuse and haphazard than it is for new molecules.
Fryback and Thornbury (2) propounded a hierarchical scheme for considering the evaluation of new medical devices in ascending order of meaningfulness to the value of the technology to clinical care. This scheme follows: diagnostic accuracy (accuracy end points such as sensitivity, specificity, and area under the receiver operating characteristic curve), diagnostic and therapeutic thinking efficacy (discerning the impact on referring physicians employing end points that are often based on surveys of physicians' considerations), and health outcomes and costs (example end points include mortality, changes in quality of life, and cost per quality-adjusted life-years). Gatsonis (3) proposed an approach to the clinical evaluation of diagnostic modalities that parallels the phase I–IV study system for therapy. The approach can be presented in a matrix format (Table), with columns corresponding to the "developmental age" of the modality and rows corresponding to different aspects of the "value" of the modality. The developmental age of the modality can be categorized as one of four phases, with technology assessment targets as follows: phase I (discovery), establishment of technical parameters and diagnostic criteria; phase II (introductory), early quantification of performance in clinical settings; phase III (mature), comparison with other modalities in large, prospective, multi-institutional studies (efficacy); and phase IV (disseminated), assessment of the procedure as utilized in the community at large (effectiveness).
|
Most radiology research and publication focuses on the lower rungs of these hierarchies, where small studies can be carried out at single institutions and paired designs requiring smaller sample sizes can be employed. Trials that address the impact of imaging modalities on patient outcomes and cost usually require randomization; larger sample sizes; much greater expense; and multi-institutional, multidisciplinary coordination to enforce the rigor required.
These factors have been daunting barriers to conducting more advanced trials. Nonetheless, while it makes little practical sense to conduct a trial to determine the impact on patient outcomes for a new technology that is still developing and changing very rapidly and for which the actual benefits and costs of use in general practice are not known, once the accuracy of a test is established, we should probably be better than we have been as a specialty about moving on to developing trials that better determine what society would really like to know about our innovations—the extent to which employing imaging contributes to improving health and at what cost. To some extent, the methodological, logistic, and financial barriers to pursuing phase III clinical trials for suitable technologies have been overcome by the development of a federally funded imaging clinical trials cooperative group, the American College of Radiology Imaging Network (ACRIN). However, even here there is a limitation of what can be done with the available funding and, because of the National Cancer Institute (NCI) funding source, ACRIN has so far only addressed imaging as it applies to cancer (4).
| PERSPECTIVE MATTERS |
|---|
|
|
|---|
Although payers often claim that their interest in making decisions to reimburse for a technology is related solely to the credibility of evidence supporting clinical use, payers clearly have financial incentives to stave off paying for expensive imaging technologies that may have broad and frequent applications. One would think that payers, having such an important financial stake, would be eager to support and participate in designing and implementing rigorous trials that would definitively inform their reimbursement decision making. In fact, the opposite has usually been the case. Payers mostly maintain that physicians (interestingly, not patients) are the beneficiaries of new technology and should underwrite the cost of evaluating innovations. In recent years, payers have begun to insist that a technology show the capacity to improve patients' health (5). This bar often is difficult for imaging technologies to cross. Imaging tends to be a node in a chain of diagnostic and therapeutic encounters, the sum of which is expected to positively affect outcome. It is often difficult to parse out the role imaging plays in achieving a successful result.
One would think, then, that, because radiologists would benefit intellectually and financially from the advent of and payment for a new imaging technology, our incentives to conduct a trial would be congruent with those of manufacturers—the earlier the better. However, there are pitfalls in this conceit. Aside from the barriers to conducting rigorous, generalizable research already discussed, there may be concern that a negative result from too early a trial may doom an otherwise valuable technology, or at the very least, set it back some years. It is quite a quandary in today's payment environment. Experience has shown that traditional single-institutional radiology research is likely to portray a new technology more positively than it would ultimately appear in multicenter studies or than it would work in general community practice. The generalizability of the results from the small, typically single, institutional studies has always been limited, and such studies are no longer sufficient to promote reimbursement for a modality. Positive results from more rigorous, more generalizable multi-institutional trials are more likely to hasten positive payment decisions. However, such a result is less likely until a technology has sufficiently matured. In addition, because medical imaging already is highly effective, to a very real extent, each new technology faces a stiffer task in demonstrating its superiority than the last.
| APPROACHES TO TRIALS DECISION MAKING |
|---|
|
|
|---|
There also is a need for guidance. There are just too many new technologies and novel applications of existing technologies to submit all of them to clinical trials. As a result, methods have been developed to assess the "value of information" that would be obtained from conducting a formal clinical trial and when might be the most propitious time to do so. In an often-cited article, Phelps and Mushlin (7) developed a decision theoretic framework for determining the expected value of diagnostic information. They proposed a two-step approach to decisions about studies of new diagnostic imaging technology. In the first step ("hurdle 1"), the expected value of perfect information is estimated. If this value is less than the cost of using the new technology, any further testing of the technology does not seem justifiable. If a modality overcomes the first hurdle, clinical studies of the diagnostic accuracy of the modality would be performed and used to estimate the expected value of the (imperfect) information that will be provided by the modality in practice. The second step ("hurdle 2") would be overcome if the expected value of the imperfect information gleaned from employing the technology exceeds its cost.
The decision theoretic model of Phelps and Mushlin (7) can also be used to decide on whether to conduct comparative studies of modalities. Specifically, if the receiver operating characteristic curve of the conventionally employed diagnostic modality is available, a "challenge region" can be computed that shows the range of sensitivity and specificity pairs that would be required for the new modality to be sufficiently cost effective in order to represent an improvement. If the challenge region consists of values of sensitivity and specificity that seem to be out of the range of possibilities for the new modality, then a comparative study may not be justifiable.
These decision analytic approaches to the question of what trials should be conducted are conceptually attractive but rarely implemented in practice. Nonetheless, their logic is valuable in thinking about imaging evaluation even if the formal modeling analysis is not performed.
| HOW DECISIONS TO CONDUCT CLINICAL TRIALS ARE MADE |
|---|
|
|
|---|
Many of these influences are exemplified in the deliberations leading up to the conduct of the Digital Mammographic Imaging Screening Trial of ACRIN (9). A more complete description of this example of how scientific, technical, methodological, and regulatory influences impinge on clinical trial decision making is available at http://radiology.rsnajnls.org/cgi/content/full/248/1/12/DC1.
The FDA made an early determination that full-field digital mammography posed a sufficient risk of increased false-positive diagnoses beyond what was experienced in screen-film mammography that it posed the threat of increasing the negative biopsy rate (Susan Alpert, PhD, MD, FDA, director of the Office of Device Evaluation, to Morgan Nields, written communication, February 9, 1999). A 1996 FDA Guidance required the submission of data showing the equivalence of full-field digital mammographic and screen-film mammographic interpretations for companies to market their full-field digital mammographic devices. However, the FDA failed a 1999 filing by GE Healthcare, Milwaukee, Wis, by using this approach and then revoked the 1996 Guidance and upgraded full-field digital mammography to class III. They further recommended that future filings address the accuracy of full-field digital mammographic interpretations. This action imposed a higher barrier of proof of the clinical efficacy of full-field digital mammography, as well as greater expense on the companies developing the technologies. In a 1999 letter to Daniel Sullivan, the associate director of the Diagnostic Imaging Program (now the Cancer Imaging Program), at the NCI, Robert Brittain of the National Electrical Manufacturers Association expressed the concern that the manufacturers were already overinvested and that the type of trial being demanded by the FDA was too expensive for any single manufacturer.
There developed a coordinated advocacy effort on behalf of digital mammography before Congress and the NCI. The resulting pressure on NCI was telling. Members of the NCI called together the key stakeholders, including the imaging device companies, the FDA, the Centers for Medicare and Medicaid Services, and the principal investigator of a proposed ACRIN trial of full-field digital mammographic screening, Etta Pisano, MD. That ACRIN existed provided the scientific, logistic, informatics, and accrual capacity to make feasible a trial of the size and level of sophistication demanded by the FDA. NCI Director Richard Klausner, MD, supported the concept for a trial that emerged from the stakeholders' meeting and eventually provided $27 million in general grant funds to ACRIN to pursue the trial. The Digital Mammographic Imaging Screening Trial opened for accrual in 2000, accrued nearly 50 000 women on schedule, and published its primary results that younger and perimenopausal women and women with dense breasts benefit from increased accuracy of interpretations attributable to full-field digital mammography in 2005 (10).
The foregoing example demonstrates that there are critical issues beyond the virtues of the technology and the importance of its application that must be accommodated if a trial is to go forward and when that occurs. The following factors all have an important impact on whether a clinical trial proceeds: the availability of a funding mechanism and research infrastructure; the relative effectiveness of proponents of a trial; the concerns of manufacturers, providers, advocacy groups, and patients; the perceived potential of an innovation to improve health; and political pressures (11).
| CONCLUSION |
|---|
|
|
|---|
Discuss this article at www.rsna.org/radiology/discuss
| FOOTNOTES |
|---|
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |