Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


DOI: 10.1148/radiol.2352040330
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gazelle, G. S.
Right arrow Articles by Beinfeld, M. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gazelle, G. S.
Right arrow Articles by Beinfeld, M. T.
(Radiology 2005;235:361-370.)
© RSNA, 2005


Special Reviews

Cost-effectiveness Analysis in the Assessment of Diagnostic Imaging Technologies1

G. Scott Gazelle, MD, MPH, PhD, Pamela M. McMahon, BS, Uwe Siebert, MD, MPH, MSc and Molly T. Beinfeld, MPH

1 From the Institute for Technology Assessment and Department of Radiology, Massachusetts General Hospital, Harvard Medical School, 101 Merrimac St, 10th Floor, Boston, MA 02114-4724 (G.S.G., P.M.M., U.S., M.T.B.); Department of Health Policy and Management, Harvard School of Public Health, Boston, Mass (G.S.G., U.S.); and PhD Program in Health Policy, Harvard University, Boston, Mass (P.M.M.). Received February 19, 2004; revision requested April 27; revision received May 4; accepted May 24. Address correspondence to G.S.G. (e-mail: scott@mgh-ita.org).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
In many ways, diagnostic technologies differ from therapeutic medical technologies. Perhaps most important, diagnostic technologies do not generally directly affect long-term patient outcomes. Instead, the results of diagnostic tests can influence the care of patients; in that way, diagnostic tests may affect long-term outcomes. Because of this, the benefits associated with the use of a specific diagnostic technology will depend on the performance characteristics (eg, sensitivity and specificity) of the test, as well as other factors, such as prevalence of disease and effectiveness of available treatments for the disease in question. The fact that diagnostic tests affect short-term, or "surrogate," outcomes, rather than long-term patient outcomes makes evaluation of these tests more complicated than the evaluation of therapeutic technologies. This article will trace the history of technology assessment in medicine, address the role of cost-effectiveness and decision analysis in health technology assessment, and describe unique features and approaches to assessing diagnostic technologies. The article will then conclude with a consideration of the limits of medical technology assessment.

© RSNA, 2005


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
The explosive growth in medical technology and procedures (13) during the past several decades has provided physicians with an unparalleled ability to diagnose abnormalities and treat patients. In many cases, however, new technologies and procedures have diffused widely without careful assessment of their appropriate roles in the care of patients. The uncertainty around the effectiveness of a diagnostic technology can result in controversy; examples of this include the use of computed tomography (CT) in lung cancer screening and mammography in women aged 40–49 years (412). The increase in available medical technologies may also have contributed to the nearly continuous increase in health care spending since the 1960s (13), the burden of which has fallen increasingly on the public sector. For instance, 45.9% of national health expenditures in the United States in 2002 were from federal or state funds (14). Heightened awareness of these issues has focused attention on constraining the growth of health care costs. Thus, as both our needs and our capabilities continue to grow in the face of constrained resources, we are increasingly faced with the dilemma of choosing either the best tests and procedures or the most cost-effective tests and procedures from among worthy alternatives, rather than merely determining which tests are effective and which are not. In particular, we are confronted with the task of identifying population subgroups for which a diagnostic procedure is clinically beneficial and cost-effective, so that clinical algorithms can be used to consider individual patient characteristics.

As we progress further into the current "era of assessment and accountability" (15), more people agree that we must improve and expand our technology assessment techniques and efforts (1620). Multiple journals are now devoted to technology assessment; however, neither technology assessment nor its specific application to health care is a new activity. Physicians and others who provide or pay for health care have long been engaged in informal technology assessment with attempts to better understand the effects and relative merits of medical interventions. Clinical research, however, has improved considerably, and we now have tools to gather data and compare the efficacies of new and existing technologies. For many of these assessments, the randomized controlled trial (RCT) is the tool of choice.

RCTs, described as "the crown jewel of traditional technology assessment" (21), have traditionally been used to evaluate the clinical effects of new technologies or procedures. When properly designed and conducted, RCTs can provide useful data for decision makers. RCTs will always be a critically important component of technology assessment activities because, if conducted appropriately, they are less prone to systematic bias. However, RCTs may be inappropriate or impractical for answering many of the questions now faced by health care decision makers. In addition to the long-appreciated difficulty of translating the observed efficacy of a technology from an RCT into real-world effectiveness, evaluating the costs of therapies may also be difficult in the context of an RCT. For example, the technologies being evaluated may be new enough that pricing does not reflect the costs that might be seen in a competitive marketplace. Clinical trials are best suited for detection of differences in treatment effect, as reflected in variables such as objective response rates or survival. In addition, the trials are generally performed with ideal circumstances by expert clinicians who practice in the best hospitals. Thus, prospective cost assessments, if performed, may not be able to be generalized to routine health care and may be even less applicable to other physicians, clinical settings, countries, or health care systems. Learning curve effects may also falsely increase the apparent cost of the new technologies.

Conducting RCTs is also expensive and time-consuming: The average phase III trial is estimated to cost $86 million (in 2000 U.S. dollars) (22). As a result, the trials are generally limited to fairly short time horizons and limited sample sizes. Thus, it may be impossible to draw conclusions about narrowly defined groups of patients and include all conditions for which the procedures might be performed. Furthermore, randomized trials are not particularly well suited for use in the evaluation of diagnostic imaging tests. Withholding a noninvasive imaging test that might provide useful diagnostic information creates difficult ethical dilemmas. RCTs also cannot be used to examine the myriad of possible combinations of tests or threshold values that might be performed in a given clinical situation, nor can they be used to predict the usefulness of a test, should treatments for the condition improve.

Diagnostic technologies differ in many ways from therapeutic medical technologies. Most important, diagnostic technologies do not generally directly affect long-term patient outcomes. Instead, the results of diagnostic tests can influence the care of patients, and in that way, they may affect long-term outcomes. Because of this, the benefits associated with the use of a specific diagnostic technology will depend on the performance characteristics (eg, sensitivity and specificity) of the test and on other factors, such as the prevalence of disease and the effectiveness of available treatments for the disease in question. The fact that diagnostic tests affect short-term, or "surrogate," outcomes rather than long-term patient outcomes makes the evaluation of these tests more complicated than the evaluation of therapeutic technologies.

This article will trace the history of technology assessment in medicine, address the role of cost-effectiveness and decision analysis in health technology assessment, and describe unique features and approaches in the assessment of diagnostic technologies. We will then conclude with a consideration of the limits of medical technology assessment.


    Brief Review of American Medical Technology Assessment Since 1960
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
The phrase "technology assessment" first entered into public policy debates during the 1960s and 1970s, principally regarding issues such as nuclear waste disposal and efforts to reduce the health risks from exposure to environmental toxins. During this period, studies by the National Academy of Sciences (23), National Academy of Engineering (24), and National Academy of Public Administration (25) created the foundation on which Congress established the Office of Technology Assessment in 1972 (26). This was followed in 1974 by the establishment of a Health Program within the Office of Technology Assessment; soon thereafter, the growth of technology assessment in medicine began to escalate.

In the 1970s, technology assessment could include evaluations of any or all of the following end points: feasibility, safety, efficacy, appropriateness, cost, or cost-effectiveness. Technology assessment in medicine, however, has focused primarily on technical and clinical end points, such as feasibility, safety, and efficacy, despite a growing recognition of the need to broaden the scope of the investigations. Additionally, and in somewhat of a contrast to the technology assessment activities in other fields (eg, environmental health), medical technology assessment was focused more on primary rather than secondary effects of the technology being considered (27).

Throughout the 1970s and 1980s, the continued development of expensive medical technologies, such as computed tomography (CT), may have provided a powerful stimulus for broadened technology assessment in medicine (2835). These new technologies offered substantial improvements in diagnosis, treatment, or both, when compared with the preceding technology, but the improvements often came with an expensive price tag. Several other developments also helped to catalyze technology assessment as a major component of medical research. Prominent among these was the creation by Congress of the National Center for Health Care Technology (NCHCT) in 1978 (36). The NCHCT was intended to become the principal federal technology assessment agency (27). It was located, along with the National Center for Health Services Research and the National Center for Health Statistics, in the Office of the Assistant Secretary of Health, and the NCHCT was responsible for analyzing emerging and existing medical technologies. The NCHCT was also responsible for advising the Health Care Financing Administration on issues related to Medicare coverage.

In many ways, the origins of a concerted technology assessment research enterprise in this country can be traced to the NCHCT (27). Powerful organizations, such as the American Medical Association and the Health Industry Manufacturers Association, expressed opposition to its formation, however, and they lobbied against its continued support. Ultimately, the combination of their opposition and a changing political landscape prevailed. Funding for the NCHCT was terminated in 1981, and the NCHCT ceased to exist shortly thereafter.

Though the demise of the NCHCT was a setback for federally supported technology assessment research in medicine, it did not signal the end. In 1984, the National Center for Health Services Research was expanded and renamed the National Center for Health Services Research and Health Care Technology Assessment, and in 1986, the Institute of Medicine of the National Academy of Sciences established the Council on Health Care Technology. During this period, the scope of medical technology assessment continued to expand, and end points relating to quality of care, patient well-being, functional health status, and costs were featured more prominently in the national research agenda (19,20,27,3742). As a result, assessments of diagnostic imaging technologies, and specifically their effects on patient management and outcomes, began to appear with increasing frequency (3035,37,38,43).

The government was not alone in supporting technology assessment in medicine. Private sector involvement also began to expand. For example, Blue Cross and Blue Shield established its own technology evaluation program, which developed specific criteria for use in the assessment of medical technologies (44). Notably, Blue Cross and Blue Shield required evidence that a net positive effect on health outcomes could be attained in general practice, which often differs from clinical trial settings. The American Medical Association, despite—or perhaps because of—its prior opposition to the NCHCT, also entered the fray, establishing its own Diagnostic and Therapeutic Technology Assessment program (45).

Medical technology assessment received its next major boost with the Omnibus Budget Reconciliation Act of 1989, which resulted in a substantial increase in funding to support medical technology assessment and paved the way for the establishment of the Agency for Health Care Policy and Research (now the Agency for Health Care Research and Quality). This agency subsumed the activities and responsibilities of the National Center for Health Services Research. It was specifically charged with developing research databases, facilitating the dissemination of research results, developing practice guidelines, and evaluating medical practice in general. The Agency for Health Care Policy and Research (now the Agency for Health Care Research and Quality) continued to play an important role in medical technology assessment throughout the 1990s and into the 21st century.

The establishment of such high-profile agencies has spawned a host of smaller agencies and organizations and has served to increase support for broadening the scope of medical research. As described previously, medical technology assessment can include a global assessment of the effect of medical practice on patient well-being (19,21) by including issues such as functional well-being (4648), quality of life (49,50), and patient preferences (51,52).

As medical researchers began to broaden the scope of their assessments, they realized that methods necessary for expanded technology assessments did not yet exist, were not fully developed, or belonged to an entirely different group of scientists than those who had traditionally assessed the safety and efficacy of medical interventions. New skills and approaches were needed, and new scientists had to be recruited to participate in the studies. Not only did the new technology assessments require physicians and biostatisticians, they also required epidemiologists, economists, quality of life experts, decision scientists, and health policy experts. Other challenges also had to be faced. The new technology assessments required access to more and different data sets than previous studies needed. Along with the increased demand for data came the need for improved methods to process, analyze, and present these data in an accessible format. Along with the desire to better understand the trade off between cost and efficacy that is faced when choosing between competing technologies (in the situation of restricted resources) came the need to develop and implement formal methods of cost-effectiveness analysis (CEA) appropriate for evaluation of health care technologies.

To better understand the state of medical technology assessment and to promote further methodologic development in the assessment of the economic value of health technologies, the U.S. Public Health Service in 1993 convened the Panel on Cost-Effectiveness in Health and Medicine. The formation of this panel was motivated by a recognition of the importance of CEA and technology assessment in health and medicine and a concern for the tremendous variations in the methods and quality of currently available analyses. This group of 13 scientists and scholars (none of whom were government employees) with expertise in the techniques and applications of CEA was charged with assessing the current state of the science and making consensus-based recommendations that might improve methods and promote the transparency of research and comparability of results. The group met 11 times over a period of 21/2 years and produced a book and several articles dealing with critical issues in CEA (5357). The focus of the panel was on policy decisions and resource allocation at a broad level, with particular attention paid to reaching consensus regarding the conduct of CEAs and proposing steps that should be taken to address the remaining unresolved issues. Principal among the panel’s recommendations was the importance of including a reference case analysis in all CEAs. The reference case analysis permits results of different CEAs to be compared and allows decision makers to better understand the implications of their resource allocation decisions when funding is directed to one program or technology rather than another.


    CEA in Medical Technology Assessment
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Medical care now consumes more than 14% of the gross domestic product in the United States, which is more than any other nation in the world (14); by 2012, it could reach 17.7% (58). Furthermore, the rate at which spending increases each year has begun to rise recently after a temporary pause (13,14). Despite the ever-expanding array of expensive new technologies, it is clear that the resources allocated to health care can no longer be allowed to expand without limit. Heightened awareness of these issues has resulted in increased pressure on medical researchers and policy makers to develop and apply new analytic methods. The hope has been that if we could only understand the trade offs between clinical benefits and costs, we might be able to make more informed decisions and rein in the spiraling health care costs.

CEA is a method used to evaluate health outcomes and costs of different medical technologies and procedures relative to one another (53,55,59). It is a tool that analysts and decision makers can use to compare competing options and select those that best meet their needs within budget constraints. CEA is used to evaluate technologies and procedures of interest through the use of the (incremental) cost-effectiveness ratio. In this ratio, changes in resource use ("costs") relative to a relevant alternative are summed in the numerator, while changes in health effects are summed in the denominator. Thus, the denominator includes productivity costs of morbidity and functional limitation. The most commonly used measures of health outcome are measures of survival, such as the number of lives or life-years saved; however, quality of life may be incorporated into the analysis by using measures such as quality-adjusted life-years (49,5963) or healthy-year equivalents (50). In addition, adjustments may be made for the timing of future benefits or costs by discounting these in an appropriate manner (6467).

When cost-effectiveness analytic techniques are used, it is possible to compare different health interventions, taking into account both cost and effectiveness (68). The resulting incremental cost-effectiveness ratios indicate the cost of each additional unit of health outcome (eg, quality-adjusted life-years) that one might wish to "purchase" by investing in health care interventions that are more expensive, more effective, or both. Strategies with lower cost-effectiveness ratios are considered to be more cost-effective than strategies with higher cost-effectiveness ratios.

Analysis generally proceeds in the following manner: First, we determine the best estimates of the parameters of interest for the analysis to perform so-called base-case analysis. Then, we estimate the uncertainty surrounding these parameter estimates to guide sensitivity analysis, where models are reanalyzed as certain key parameters are varied throughout a reasonable range of values. The particular parameters selected and the range through which they are varied is guided by knowledge regarding the range of reasonable estimates for these parameters, suspicion that their variation may substantially affect results, or a desire to explore the effect that potential changes in diagnostic accuracy, treatment effectiveness, or cost might have on predicted results.

In the evaluation of medical interventions, CEA use is based on the premise that decision makers would like to maximize health outcomes at any given level of spending (59,69). CEA helps to define the opportunity cost of selecting one intervention rather than another. Different options are compared by using comparable measures of cost and outcome, and the resulting incremental cost-effectiveness ratios can be used to determine the cost of each additional unit of health outcome. By funding programs in order of increasing incremental cost-effectiveness ratio (59), it is possible to obtain the maximum health benefits given a fixed budget constraint. CEA, as seen in this manner, is a tool with which to optimize resource allocation to programs that compete for funds from the same limited budget.

CEA has been used to examine a variety of medical technologies, procedures, and programs. A review of these analyses, or the scope of the possible applications of CEA in medical technology assessment, is beyond the scope of this article. We direct interested readers to an online registry of CEAs maintained by the Harvard Center for Risk Analysis (www.hsph.harvard.edu/cearegistry/).

The information provided by CEAs can be used in many different settings. It can be used to guide policy making at the societal level, program implementation at the local or state level, purchasing decisions at the hospital or health system level, coverage decisions by third-party payers, or pricing and marketing decisions by pharmaceutical and device manufacturers.

Why, then, is CEA not universally used to determine how we spend all of our health care dollars? To begin with, cost-effectiveness is not the only important criterion in making decisions concerning resource allocation, nor should it be. Many additional factors are important when setting funding priorities. Simple mechanical optimization of limited resources by using cost-effectiveness ratios ignores important issues such as distributive justice, equity, and benefits and costs outside of the health care system (53). It also may not permit appropriate comparison of interventions that affect different subgroups of the entire population. For example, questions have been raised about comparing interventions that affect young people with those that affect old people, those that affect wealthy people with those that affect poor people, or those that affect healthy people with those that affect unhealthy people. In each case, cost-effectiveness ratios may not be the most appropriate metric with which to compare interventions.


    The Role of Decision Analysis
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Decision analysis is an analytic approach that can be used to guide decision making when conditions of uncertainty exist (7075). It allows the analyst to integrate all available information and take into account any underlying uncertainty or complex sequencing of events and decisions. Decision analysis has been used in a wide variety of settings, from stock market investment to cancer chemotherapy. It is particularly well suited for medical interventions, however, because of the complexity of clinical decisions and because virtually all clinical decisions are made in the face of some uncertainty. Moreover, decision analysis has been performed in different areas of medicine, such as primary prevention (risk factor reduction), secondary prevention, screening, diagnostic procedures, therapy, and rehabilitation (76). Several situations in which decision-analytic modeling is helpful or even required have been described elsewhere (77). In most of these situations, decision models are used for combining or linking data from different research areas and sources or transferring or extrapolating results from one time, place, population, or setting to another (77).

Decision analysis proceeds in a stepwise fashion. First, the problem to be addressed is identified and bounded. Next, the problem is structured, and the information that will be needed to analyze it is identified. Finally, the optimal course of action, based on the available data and analysis performed, is determined (72). Often there is no single best decision, or the answer is highly dependent on the particular assumptions of the decision model. In either case, the optimal decision must be qualified; however, if no single best choice can be determined, decision analysis can be a valuable tool in the identification of critical factors and their influence on the optimal choice. It can also be used to guide prospective data collection and efficiently narrow the bounds of uncertainty surrounding those variables or assumptions that have the greatest effect on the ultimate decision (78,79).

The use of decision analysis to guide complex resource allocation decisions has one important advantage: It forces individuals to be explicit about the factors that influence their judgments and decisions. The transparency provides a tool with which to assess the soundness of the judgments made based on the available data and to better understand the effect that changing assumptions might have on optimal choices. When disagreements exist, it is often possible to move toward consensus by being explicit about each component of the decision (eg, data, beliefs, event probabilities) and resolving disagreements about specific components one at a time rather than focusing on the decision at a macro level.


    Unique Features of Diagnostic Technology Assessment
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Assessment of diagnostic technologies differs from the evaluation of medical therapeutics in many respects. One of the most important and challenging differences is the indirect relationship between the results of a diagnostic test and the actual health outcome in patients. Diagnostic test results are intermediate outcomes; they influence, but do not directly determine, health outcomes in patients. A number of factors can alter the usual chain of events and prevent the patient from receiving the intended treatment. For example, the physician may misinterpret the patient’s symptoms at presentation and order the wrong diagnostic test. The diagnostic test may be technically inadequate, or it may be technically adequate, but the specialist may err when interpreting the study. The correct interpretation may not be accurately transmitted to the referring physician, or the referring physician may select the wrong treatment. The patient may elect not to comply with the recommendations of the physician. Primarily depending on the severity of the disease and the effectiveness of the treatment used, the patient’s health may improve, stay the same, or deteriorate.

Although imaging procedures share the generic features of all diagnostic tests, there are several issues that are peculiar to imaging tests. First, the test results are often multidimensional (eg, presence or absence, location, size, form, and constitution of a tumor) rather than one-dimensional, as is the level of a tumor marker in a blood test. Second, clear cut points are rarely established; thus, test results must often be summarized in terms of likelihoods, such as "very unlikely," "unlikely," "likely," and "very likely," instead of in terms of categories, such as "test positive" and "test negative." Third, images can reveal signs for different diseases, further adding to the complexity of the decision making process. Fourth, imaging techniques may be associated with the risk of radiation-induced side effects, leading to a clinical trade off between benefit and harm. Fifth, image quality increases with improved resolution; thus, the results of diagnostic studies are often outdated when devices with better image quality emerge. Sixth, many emerging imaging tests are expensive. Finally, an important feature for the evaluation of a diagnostic procedure is that images can be assessed at different times and by different readers, which allows us to analyze intra- and interobserver agreement.

Although more attention has recently been focused on diagnostic technology assessment, assessing the efficacy and effect of diagnostic imaging has been a concern of radiologists and policy makers for many years. Pioneering work in this field was performed by Ledley and Lusted (80), Thornbury et al (81), Fineberg and Hiatt (19), Fineberg (34), Fineberg and colleagues (35), Abrams and McNeil (30,31,43), McNeil and colleagues (37), McNeil and Adelstein (82), and other researchers. During the past two decades, methodologic and applied research on diagnostic technology assessment has blossomed, and many important issues have been addressed (8390).

Abrams and McNeil (43) suggested that the complexity of the relationship between diagnostic test results and actual health outcomes has led to the use of more accessible "process" measures. The literature is replete with studies that have been performed to assess tests that use parameters such as "lesion conspicuity," number of correct diagnoses made, and number of unanticipated findings. Even more reports have simply described the frequency with which a particular finding is associated with a disease or group of diseases or reported the spectrum of findings associated with one disease or another. Relatively fewer reports have estimated the effect of imaging technologies on final health outcomes at either the patient or the societal level.


    A Hierarchical Approach to Assessment of Diagnostic Efficacy
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Against the backdrop of the introduction of CT, Fineberg et al (35) proposed a hierarchical conceptual framework for use in the evaluation and comparison of diagnostic imaging tests, elements of which had already been described in the literature (81,82). The approach of Fineberg et al (35), later adopted and modified by others (9194), includes sequential evaluation of the following six levels of efficacy: (a) technology, (b) diagnostic accuracy, (c) diagnostic thinking; (d) therapeutic planning; (e) patient outcomes, and (f) society. Efficacy at higher levels is contingent on efficacy at all lower levels, but improvements at any given level do not necessarily translate into improvements at higher levels. Furthermore, a test with fixed efficacy at one level (N) may have different efficacies at higher levels (N + 1) in different clinical situations.

Level 1, or "technical efficacy," generally falls within the domain of physicists and engineers who develop and refine an imaging technology before its clinical implementation and testing. Technical efficacy is usually judged by using parameters that can be precisely measured in a laboratory with optimal conditions, such as spatial resolution, quantum mottle, necessary exposure time, and radiation dose.

Level 2, or "diagnostic accuracy efficacy," of a binary test is expressed by using sensitivity and specificity, positive and negative predictive values, or receiver operating characteristic (ROC) curves. The two most common measures of test performance are sensitivity, TP/(TP + FN), and specificity, TN/(TN + FP), where TP is the number of true-positive results, FN is the number of false-negative results, TN is the number of true-negative results, and FP is the number of false-positive results; these can be calculated when a reliable reference standard is available. Biases may result, however, when the reference standard is either imperfect or not uniformly applied (described in more detail later) or when test results are uninterpretable, distorted by measurement bias, or caused by differences in the "case mix" between study populations (9597).

Often, the performance of new tests is evaluated by using existing diagnostic methods as a reference standard. If these existing diagnostic methods are imperfect (ie, "tarnished" reference standard), however, it may be impossible to determine whether what appears to be low specificity (ie, many false-positive results) represents truly poor specificity or improved sensitivity, relative to the reference standard. That is, the new test may enable the detection of abnormalities that were undetectable by using the standard of reference; thus, the results of the new test are incorrectly classified as false-positive interpretations. Attempts to improve the standard of reference by using subsequently obtained consensus readings (generally including all available diagnostic information) may yield biased estimates, particularly if the consensus panel treats a positive result with either test as indicative of true disease.

Additional problems may arise when the reference standard is not uniformly applied to all patients, and the probability of verification depends on the diagnosis rendered by the diagnostic test being evaluated (ie, "verification bias"). It is not uncommon for patients to undergo confirmatory tests only (or preferentially) when the initial diagnostic test result indicates the presence of disease. Methods have been proposed to correct for verification bias (96,97), but these have been infrequently applied, and they can only be used if at least a fraction of the patients have been assessed with the reference standard (98).

ROC analysis (99103) has roots in signal detection theory. ROC analysis addresses the critical role of the individual who interprets the imaging study, and it is used explicitly to evaluate the threshold for test positivity (ie, positivity criterion); as the threshold is varied, there is a necessary trade off between sensitivity and specificity. ROC analysis allows one to compare the diagnostic value of different tests as a function of the positivity criterion (ie, before choosing such a criterion). Given sufficient data on sensitivity and specificity pairs for a particular technology, meta-analytic techniques can yield a summary ROC curve (104,105), which adjusts for different positivity criteria and provides an overall estimate of test performance. A disadvantage is that ROC analysis is only appropriate for the evaluation of diagnostic tests when the disease space is dichotomous (eg, presence of disease vs absence of disease). In routine clinical practice, there are often more than two disease states (eg, different levels of disease severity), and the disease can have more than one dimension (eg, number and location of metastases). The optimal method for multiple-state diagnostic test evaluation remains undetermined.

Level 3, or "diagnostic thinking efficacy," is used to measure the effect of diagnostic test results on the thinking of physicians. Because it is so difficult to establish a connection between diagnostic test results and patient outcomes, measuring the effect of test results on the diagnostic thinking of clinicians might be a reasonable proxy for the effect of tests on outcomes. For example, suppose that a clinician is considering two equally likely diagnoses for a particular patient. Further suppose that the findings of an imaging test strongly favor one of the possible diagnoses. We would expect that the clinician would revise his diagnosis to reflect the test results and that he or she would consider the test (if correct) to have provided some benefit to the patient because adequate treatment can be provided.

The odds-likelihood ratio form of Bayes theorem provides a framework for evaluating the effect of diagnostic information on diagnostic thinking (72):

{r05ma34e01}
or

{r05ma34e02}
where P[D+] represents the probability disease (D) is present (prevalence), P[D] represents the probability that disease is not present, P[D+|R] represents the probability of disease given the test result (R), P[D|R] represents the probability of no disease given the test result, P[R|D+] represents the probability of result given the disease, and P[R|D] represents the probability of result given no disease.

The following equation was used to calculate the prior odds favoring (OF) disease being present:

{r05ma34e03}

The following equation was used to calculate the prior odds against (OA) disease being present:

{r05ma34e04}

Similarly, the following equation was used to calculate the posterior odds favoring disease being present:

{r05ma34e05}

Finally, the following equation was used to calculate the posterior odds against disease being present:

{r05ma34e06}

The likelihood ratio represents the ratio of the frequency of a certain test result in patients with disease to its frequency in patients without disease. The likelihood ratio can be used to judge the usefulness of a particular test in a given clinical situation (81,106108). Advocates of evidence-based medicine have also recommended the use of likelihood ratios in the evaluation of diagnostic technologies (109,110).

Level 4 is known as "therapeutic planning efficacy." The greatest efficacy at this level results from a test that might (correctly) lead to the initiation of a new therapy or the determination that therapy is not required. Studies concerned with level 4 efficacy are generally performed to compare intended patient care strategies prior to the test with intended patient care strategies after the test. Studies of the efficacy of body CT by Wittenberg and colleagues (32,33) illustrate this type of assessment. Level 4 efficacy questions are extremely challenging, however, since it is often difficult to determine what would have been done without the results of a diagnostic test once those results are available. Even if one were to query physicians prior to providing them with the test results, the situation is artificial; they know that the test has been performed, and neither they nor their patients will face any real risks on the basis of answers to these hypothetical questions.

Level 5, or "patient outcome efficacy," can really only be assessed in a prospective RCT, in which some patients undergo the test but others do not, and patient outcomes in the two groups (test vs no test) are compared. Unfortunately, RCTs of this sort are difficult to perform and are associated with challenging practical, analytic, and ethical issues. Imaging tests may have high sensitivity and specificity, cause important changes in clinicians’ diagnostic thinking, and even cause different therapies to be instituted. If these changes cannot be translated into improved patient outcomes, however, the value of the test—at least from the patients’ perspective—is questionable.

An additional challenge is to combine patient outcomes in different dimensions (eg, years of life gained, quality of life, preference for one test over another) into a single meaningful outcome measure (4648,111,112). For example, in some cases, the psychologic effects of diagnostic information, whether positive or negative, may be more important to patients than other effects that the test might have on outcomes. Examples are the reassurance value of a negative test (113) or the anxiety introduced by an imperfect or ambiguous test result. Determining how to relate these effects to overall quality of life remains to be investigated.

Level 6, or "societal efficacy," asks whether the benefit to society associated with the use of a test is acceptable in relation to its cost. In other words, is the test an efficient use of societal resources? In the current era of cost-conscious medical care, no major technology assessment effort should be considered complete without addressing cost-effectiveness considerations; however, these assessments are often the most challenging to perform. Level 6 assessments share all of the difficulties related to the assessment of efficacy of level 1–5 assessments, and they require a substantial amount of time and analytic resources to complete. A further complication is estimation of the cost of newly developed tests. Prices of new or experimental imaging technologies may not adequately represent their true cost in a competitive marketplace, and cost-based reimbursement rates often do not yet exist or are woefully inaccurate. Conducting a detailed analysis of each component of a particular test or procedure can begin to address these issues, but this analysis cannot possibly be performed for each new test as it becomes available. Finally, most new imaging technologies continue to undergo rapid technologic evolution in the years immediately after clinical implementation. Even if it were possible to estimate the costs and effects of each new test, it would be unreasonable to expect the estimates to be accurate for long. All too often, level 6 analyses cannot be accomplished before decisions regarding resource allocation must be made.

When deciding whether or not to commit resources to new imaging technologies, physicians have rarely considered costs and effects at a societal level. There are a number of reasons for this. To begin with, neither physicians nor patients directly bear the costs of diagnostic testing or therapeutic medical interventions. In addition, decisions are generally made in the context of individual patient encounters. In this context, it is difficult for a physician to withhold a test or treatment that may provide some benefit. It is even more difficult for the patient—who may be facing a life-threatening illness—to accept that a test or therapy is withheld because of concerns about optimizing resource allocation at a societal level.


    Decision Analysis and Diagnostic Technology Assessment
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
A decision analytic model (72,114118) can be used to integrate the data from all of these hierarchical levels of efficacy and evaluate final patient outcomes in relation to diagnostic test results, across a range of varying assumptions concerning disease prevalence, test performance, therapeutic efficacy, and costs. It is possible to model patient outcomes beyond the time horizons of short-term treatment studies. Sensitivity analysis can then be used to analyze the effect of varying assumptions on the conclusions reached and determine the conditions in which a particular test or combination of tests would be optimal.

Phelps and Mushlin (117) developed a strategy for evaluation of diagnostic technologies by using medical decision theory. The Phelps and Mushlin model presumes that there is some societal threshold, in dollars per quality-adjusted life-year, above which the investment of societal resources cannot be justified, and below which it is generally not questioned. Given ROC parameters for an existing procedure, a "challenge region" is computed and projected into ROC space. The ROC curve for a new imaging test must fall into the challenge region for its incremental cost-effectiveness to be less than the societal threshold.

The analysis proceeds in two steps. First, expected costs and patient outcomes with the new test (assuming perfect diagnostic accuracy) are compared with expected costs and outcomes in the absence of the additional diagnostic information that the new test might provide. If, by using these most optimistic assumptions, the ROC curve of the new test fails to enter the challenge region, it can be eliminated from further consideration (hurdle 1). If this first hurdle is met, clinical studies are undertaken to more precisely define the performance characteristics of the new test; the decision model can help identify critical information to be assessed. Next, by using the results of the clinical trials in combination with the decision models already developed, the incremental cost-effectiveness of the new test (relative to the existing alternative) is recalculated and compared to the (hypothetical) societal acceptability threshold (hurdle 2).

The principal advantage of the Phelps and Mushlin (117) analytic approach is that societal cost-effectiveness can be addressed by using relatively straightforward level 2, or diagnostic accuracy, efficacy data. By performing the assessments in two steps, tests that are "noncontenders" may be eliminated prior to determining sensitivity and specificity and constructing ROC curves. For those tests that pass the first hurdle (ie, their ROC curves fall into the challenge region under the assumption of perfect diagnostic accuracy), the decision models can help focus prospective data collection to ensure that necessary data are collected efficiently. The use of medical decision theory may permit more costly and time-consuming clinical trials to be either avoided altogether or streamlined substantially. If the decision to implement a new test would not vary over a range of possible values of some parameters, then expensive and time-consuming data collection may be avoided (78,79).


    Randomized Controlled Trials in Diagnostic Technology Assessment
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Decision analytic methods necessarily involve the use of simplifying assumptions and may rely on inaccurate parameter estimates. Nonrandomized trials or retrospective data collection may produce overly optimistic estimates of benefit. RCTs are often required to confirm the results of a decision analysis and to be certain that confounding has not biased the results in favor or against the test being considered.

Designing and conducting an RCT to assess diagnostic imaging tests is less straightforward than studying therapies. In an RCT of a therapy, the treatment is generally withheld from some patients and provided to others. However, because most diagnostic tests have minimal or no adverse effects, and because many patients and physicians may believe that more information results in better care, it is often difficult to withhold a diagnostic test from patients enrolled in a trial. It might be possible to perform both tests in all patients and only randomize those patients with discordant test results; however, even that creates ethical dilemmas.

In an alternative approach (94), both tests are performed in all patients, but one of the tests is chosen at random as the test that will be used to determine therapy. Only the results of the randomly chosen test are provided to the physician(s) caring for the patient. By using this study design, it may also be possible to determine—in a separate setting—what care decisions would have been made if the results of a test that was not selected had been used. The advantage of this approach is that long-term follow-up may be omitted for patients with concordant (and in some cases, discordant) test results because the same care decisions would nevertheless have been made. This approach can only be used if the tests do not have complications that result in long-term consequences and if all of the possible effects of each test can be distinguished from one another and from the effects of therapy.

With the ever-increasing number of diagnostic tests available, it is often the case that several tests or combinations of tests are potentially useful. Comparing all of these tests in a clinical trial with sufficient statistical power to detect meaningful differences in outcome may require an extremely large trial and could take years to complete. Even if it were possible to initiate such a trial, the results may be confounded by treatment effects or invalidated by new therapies.


    Limitations of Technology Assessment
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
There is little question that technology assessment of medical technologies and practices is here to stay. The more critical questions are as follows: What can we hope to gain from increased efforts in this area? What will be the challenges that lie ahead?

Elhauge (119) has argued that technology assessment is important in our quest to better understand and improve the quality of medical care, but technology assessment is relatively limited in its ability to address the specific problem of cost escalation. He argues that if we are to constrain the rapid increase in health care spending, proper incentives to trade off costs and benefits are needed, rather than more technology assessments. He further suggests that, if anything, the move to a more cost-sensitive means of health care financing is likely to decrease the use of technology assessment and restrict the development and clinical implementation of expensive new therapies. A truly cost-sensitive financing system, he argues, will encourage providers to avoid overly expensive technologies and discourage researchers and manufacturers from developing them. Medical care in the United States is provided in an absolutist environment, however, and both professional and legal standards rarely permit an individual physician or payer to withhold potentially beneficial treatment from patients. Providers certainly do have appropriate incentives to avoid treatments with no benefit (and especially those that might harm their patients); however, the requirements for technology assessment in this setting are more modest (ie, to provide information regarding efficacy alone). The use of technology assessment in a broader role is unlikely because regulators lack the expertise to actually weigh benefits and costs and because the technical complexity of medical technology assessments, as well as the continuous evolution of medical technologies, makes their use in actual practice difficult.

Despite the valid concerns, it seems clear that more careful consideration of the costs and benefits of medical technology and interventions will be required as we continue to address the challenges facing modern medicine.


    Essentials
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 
Technology assessment includes evaluation of any or all of the following: feasibility, safety, efficacy, appropriateness, cost, or cost-effectiveness.

Cost-effectiveness analysis is a method for evaluating the health outcomes and costs of different medical technologies and procedures relative to one another.

Evaluation of diagnostic imaging technologies differs in many important ways because of unique features of diagnostic imaging.


    FOOTNOTES
 
Authors stated no financial relationship to disclose.

Abbreviations: CEA = cost-effectiveness analysis, NCHCT = National Center for Health Care Technology, RCT = randomized controlled trial, ROC = receiver operating characteristic


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 Brief Review of American...
 CEA in Medical Technology...
 The Role of Decision...
 Unique Features of Diagnostic...
 A Hierarchical Approach to...
 Decision Analysis and Diagnostic...
 Randomized Controlled Trials in...
 Limitations of Technology...
 Essentials
 REFERENCES
 

  1. Pashos CL, Newhouse JP, McNeil BJ. Temporal changes in the care and outcomes of elderly patients with acute myocardial infarction, 1987 through 1990. JAMA 1993; 270:1832-1836.[Abstract/Free Full Text]
  2. Maitino A, Levin D, Parker L, Rao V, Sunshine J. Nationwide trends in rates of utilization of noninvasive diagnostic imaging among the Medicare population between 1993 and 1999. Radiology 2003; 227:113-117.[Abstract/Free Full Text]
  3. Fuchs VR. Health care for the elderly: how much? who will pay for it? Health Aff (Millwood) 1999; 18:11-21.[Abstract]
  4. Miettinen O, Henschke C. CT screening for lung cancer: coping with nihilistic recommendations. Radiology 2001; 221:592-596.[Abstract/Free Full Text]
  5. Miettinen O, Henschke C. Commentary on Drs. Patz, Black, and Goodman’s viewpoint. Radiology 2001; 221:598-599.
  6. Patz E, Goodman P, Bepler G. Screening for lung cancer. N Engl J Med 2000; 343:1627-1633.[Free Full Text]
  7. Patz E, Black W, Goodman P. CT screening for lung cancer: not ready for routine practice. Radiology 2001; 221:587-591.[Abstract/Free Full Text]
  8. Patz E, Black W, Goodman P. Commentary on Drs. Miettinen and Henschke’s viewpoint. Radiology 2001; 221:597.
  9. Gøtzsche P, Olsen O. Is screening for breast cancer with mammography justifiable? Lancet 2000; 355:129-134.[CrossRef][Medline]
  10. Sox H. Screening mammography for younger women: back to basics. Ann Intern Med 2002; 137:361-362.[Free Full Text]
  11. Woloshin S, Schwartz LM, Byram SJ, Sox HC, Fischhoff B, Welch HG. Women’s understanding of the mammography screening debate. Arch Intern Med 2000; 160:1434-1440.[Abstract/Free Full Text]
  12. Ransohoff DF, Harris RP. Lessons from the mammography screening controversy: can we improve the debate? Ann Intern Med 1997; 127:1029-1034.[Abstract/Free Full Text]
  13. Altman SH, Tompkins CP, Eilat E, Glavin MP. Escalating health care spending: is it desirable or inevitable? Health Aff (Millwood) 2003; Suppl Web Exclusives:W3–1-14.
  14. Levit K, Smith C, Cowan C, Sensenig A, Catlin A, for the Health Accounts Team. Health spending rebound continues in 2002. Health Aff (Millwood) 2004; 23:147-159.[Abstract/Free Full Text]
  15. Relman AS. The third revolution in medical care. N Engl J Med 1988; 319:1220-1222.[Medline]
  16. Roper WL, Hackbarth GM. HCFA’s agenda for promoting high-quality care. Health Aff (Millwood) 1988; 7:91-98.[CrossRef][Medline]
  17. Roper WL, Winkenwerder W, Hackbarth GM, Krakauer H. Effectiveness in health care: an initiative to evaluate and improve medical practice. N Engl J Med 1988; 319:1197-1202.[Medline]
  18. Guadagnoli E, McNeil BJ. Outcomes research: hope for the future or the latest rage? Inquiry 1994; 31:14-24.[Medline]
  19. Fineberg HV, Hiatt HH. The case for technology assessment. N Engl J Med 1979; 301:1086-1091.[Abstract]
  20. Brook RH, Lohr KN. Efficacy, effectiveness, variations, and quality. Med Care 1985; 23:710-721.[Medline]
  21. Fuchs VR, Garber AM. The new technology assessment. N Engl J Med 1990; 323:673-677.[Medline]
  22. DiMasi JA, Hansen RW, Grabowski HG. The price of innovation: new estimates of drug development costs. J Health Econ 2003; 22:151-185.[CrossRef][Medline]
  23. House Committee on Science and Astronautics. Technology: processes of assessment and choice—report of the National Academy of Sciences. July 1969.
  24. House Committee on Science and Astronautics. A study of technology assessment: report of the Committee on Public Engineering Policy, National Academy of Engineering. July 1969.
  25. House Committee on Science and Astronautics. A technology assessment system for the executive branch: report of the National Academy of Public Administration. July 1970.
  26. Technology Assessment Act of 1972, Public L No. 92–484 1972.
  27. Rettig RA. Technology assessment—an update. Invest Radiol 1991; 26:165-173.[CrossRef][Medline]
  28. Larson EB, Omenn GS. The impact of computed tomography on the care of patients with suspected brain tumor. Med Care 1977; 15:543-551.[CrossRef][Medline]
  29. Creditor MC, Garrett JB. The information base for diffusion of technology: computed tomography scanning. N Engl J Med 1977; 297:49-52.[Medline]
  30. Abrams HL, McNeil BJ. Medical implications of computed tomography ("CAT scanning") (first of two parts). N Engl J Med 1978; 298:255-261.[Medline]
  31. Abrams HL, McNeil BJ. Medical implications of computed tomography ("CAT scanning") (second of two parts). N Engl J Med 1978; 298:310-318.[Medline]
  32. Wittenberg J, Fineberg HV, Black EB, et al. Clinical efficacy of computed body tomography. AJR Am J Roentgenol 1978; 131:5-14.[Abstract]
  33. Wittenberg J, Fineberg HV, Ferrucci JT, et al. Clinical efficacy of body computed tomography. II. AJR Am J Roentgenol 1980; 134:1111-1120.
  34. Fineberg HV. Evaluation of computed tomography: achievement and challenge. AJR Am J Roentgenol 1978; 131:1-4.[Medline]
  35. Fineberg HV, Bauman R, Sosman M. Computerized cranial tomography: effect on diagnostic and therapeutic plans. JAMA 1977; 238:224-227.[Abstract/Free Full Text]
  36. A Consortium for Assessing Medical Technology. A planning study report by a committee of the Institute of Medicine, Division of Health Sciences Policy Washington, DC: National Academy Press, 1983.
  37. McNeil B, Collins JJ, Adelstein SJ. Rationale for seeking occult metastases in patient with bronchial carcinoma. Surg Gynecol Obstet 1977; 144:389-393.[Medline]
  38. McNeil BJ, Varady PD, Burrows BA, Adelstein SJ. Measures of clinical efficacy: cost-effectiveness calculations in the diagnosis and treatment of hypertensive renovascular disease. N Engl J Med 1975; 293:216-221.[Abstract]
  39. Brook RH, Chassin MR, Fink A, Solomon DH, Kosekoff J, Park RE. A method for the detailed assessment of the appropriateness of medical technologies. Int J Technol Assess Health Care 1986; 2:53-63.[Medline]
  40. Wennberg J, Gittelsohn AM. Small area variations in health care delivery. Science 1973; 182:1102-1108.[Abstract/Free Full Text]
  41. Wennberg JE, Freeman JL, Culp WJ. Are hospital services rationed in New Haven or over-utilised in Boston? Lancet 1987; 1:1185-1189.[CrossRef][Medline]
  42. Fowler FJ, Jr, Wennberg JE, Timothy RP, Barry MJ, Mulley AG, Jr, Hanley D. Symptom status and quality of life following prostatectomy. JAMA 1988; 259:3018-3022.[Abstract/Free Full Text]
  43. Abrams HL, McNeil BJ. Computed tomography: cost and efficacy implications. AJR Am J Roentgenol 1978; 131:81-87.[Abstract]
  44. Blue Cross and Blue Shield Association. Technology Assessment Criteria
  45. U.S. Congress OoTA. Unconventional Cancer Treatments, OTA-H-405 Washington, DC: U.S. Government Printing Office, 1990.
  46. Torrance GW. Measurement of health state utilities for economic appraisal: a review. J Health Econ 1986; 5:1-3.[CrossRef][Medline]
  47. Patrick DL, Deyo RA. Generic and disease-specific measures in assessing health status and quality of life. Med Care 1989; 27(suppl 3):S217-S232.[Medline]
  48. Bergner M. Quality of life, health status, and clinical research. Med Care 1989; 27(suppl 3):S148-S156.[CrossRef][Medline]
  49. Pliskin JS, Shepard DS, Weinstein MC. Utility functions for life years and health status. Operations Res 1980; 28:206-224.
  50. Mehrez A, Gafni A. Quality-adjusted life years, utility theory, and healthy-years equivalents. Med Decis Making 1989; 9:142-149.
  51. Barry MJ, Mulley AG, Fowler JF, Wennberg JE. Watchful waiting vs immediate transuretheral resection for symptomatic prostatism: the importance of patients’ preferences. JAMA 1988; 259:3010-3017.[Abstract/Free Full Text]
  52. Read JL, Quinn RJ, Berwick DM, Fineberg HV, Weinstein MC. Preferences for health outcomes: comparisons of assessment methods. Med Decis Making 1984; 4:315-329.
  53. Russell LB, Gold MB, Siegel JE, Daniels N, Weinstein MC. The role of cost-effectiveness analysis in health and medicine. JAMA 1996; 276:1172-1177.[Abstract/Free Full Text]
  54. Siegel JE, Weinstein MC, Russell LB, Gold MR. Recommendations for reporting cost-effectiveness analyses. JAMA 1996; 276:1339-1341.[Abstract/Free Full Text]
  55. Gold MR, Siegel JE, Russel LB, Weinstein MC. Cost-effectiveness in health and medicine New York, NY: Oxford University Press, 1996.
  56. Siegel JE, Torrance GW, Russell LB, Luce BR, Weinstein MC, Gold MR. Guidelines for parmacoeconomic studies: recommendations from the panel on cost-effectiveness in health and medicine. Pharmacoeconomics 1997; 11:159-168.[Medline]
  57. Weinstein MC, Siegel JE, Gold MR, Kamlet MS, Russell LB. Recommendations of the panel on cost-effectiveness in health and medicine. JAMA 1996; 276:1253-1258.[Abstract/Free Full Text]
  58. Heffler S, Smith S, Keehan S, Clemens MK, Won G, Zezza M. Health spending projections for 2002–2012. Health Aff (Millwood) 2003. Suppl Web Exclusives:W3–54-65.
  59. Weinstein MC, Stason WB. Foundations of cost-effectiveness analysis for health and medical practices. N Engl J Med 1977; 296:716-721.[Abstract]
  60. Neumann PJ, Hermann RC, Weinstein MC. Measuring QALYs in dementia. In: Wimo A, Jonsson B, Karlsson G, Winblad B, eds. Health economics of dementia. New York, NY: Wiley, 1998; 359-370.
  61. Tsevat J, Weeks JC, Guadagnoli E, et al. Health-related quality of life assessment: a framework for clinical encounters, clinical trials, and health policy. J Gen Intern Med 1994; 9:576-582.[Medline]
  62. Carr-Hill RA. Assumptions of the QALY procedure. Soc Sci Med 1989; 29:469-477.
  63. Johannesson M, Pliskin JS, Weinstein MC. A note on QALYs, time tradeoff, and discounting. Med Decis Making 1994; 14:188-193.
  64. Krahn M, Gafni A. Discounting in the economic evaluation of health care interventions. Med Care 1993; 31:403-418.[Medline]
  65. Horowitz JK, Carson RT. Discounting statistical lives. J Risk Uncertainty 1990; 3:403-413.[CrossRef]
  66. Parsonage M, Neuburger H. Discounting and health benefits. Health Econ 1992; 1:71-76.[Medline]
  67. Keeler EB, Cretin S. Discounting of life-saving and other nonmonetary effects. Management Sci 1983; 29:300-306.
  68. Drummond MF, O’Brien B, Stoddart GL, Torrance GW. Methods for the economic evaluation of health care programmes New York, NY: Oxford University Press, 1997.
  69. Zeckhauser R. Procedures for valuing lives. Public Policy 1975; 23:419-464.[Medline]
  70. Raiffa H. Decision analysis: introductory lectures on choices under uncertainty Reading, Mass: Addison-Wesley, 1968.
  71. Schwartz WB, Gorry GA, Kassirer JP, Essig A. Decision analysis and clinical judgment. Am J Med 1973; 55:459-472.[CrossRef][Medline]
  72. Weinstein MC, Fineberg HV, Elstein AS, et al. Clinical decision analysis Philadelphia, Pa: Saunders, 1980.
  73. Halpern EF, Weinstein MC, Hunink MGM, Gazelle GS. Representing both first- and second-order uncertainties by Monte Carlo simulation for groups of patients. Med Decis Making 2000; 20:314-322.[Abstract/Free Full Text]
  74. Doubilet P, Begg CB, Weinstein MC, Braun P, McNeil BJ. Probabilistic sensitivity analysis using Monte Carlo simulation: a practical approach. Med Decis Making 1985; 5:157-177.
  75. O’Brien BJ, Drummond MF, Labelle RJ, Willan A. In search of power and significance: issues in the design and analysis of stochastic cost-effectiveness studies in health care. Med Care 1994; 32:150-163.[CrossRef][Medline]
  76. Tengs TO, Adams ME, Pliskin JS, et al. Five-hundred life-saving interventions and their cost-effectiveness. Risk Anal 1995; 15:369-390.[CrossRef][Medline]
  77. Siebert U. When should decision-analytic modeling be used in the economic evaluation of health care? (editorial). Eur J Health Econ 2003; 4:143-150.
  78. Chilcott J, Brennan A, Booth A, Karnon J, Tappenden P. The role of modelling in prioritising and planning clinical trials. Health Technol Assess 2003; 7(23):iii, 1-125.
  79. Townsend J, Buxton M, Harper G. Prioritisation of health technology assessment: the PATHS model—methods and case studies. Health Technol Assess 2003; 7(20):iii, 1-82.
  80. Ledley RS, Lusted LB. Reasoning foundations of medical diagnosis. Science 1959; 130:9-21.[Free Full Text]
  81. Thornbury JR, Fryback DG, Ward E. Likelihood ratios as a measure of the diagnostic usefulness of excretory urogram information. Radiology 1975; 114:561-565.[Abstract]
  82. McNeil BJ, Adelstein SJ. Determining the value of diagnostic and screening tests. J Nucl Med 1976; 17:439-448.[Free Full Text]
  83. Visser K, Kock MC, Kuntz KM, Donaldson MC, Gazelle GS, Hunink MG. Cost-effectiveness targets for multi-detector row CT angiography in the work-up of patients with intermittent claudication. Radiology 2003; 227:647-656.[Abstract/Free Full Text]
  84. Visser K, Idu MM, Buth J, Engel GL, Hunink MG. Duplex scan surveillance during the first year after infrainguinal autologous vein bypass grafting surgery: costs and clinical outcomes compared with other surveillance programs. J Vasc Surg 2001; 33:123-130.[CrossRef][Medline]
  85. Visser K, Hunink MG. Peripheral arterial disease: gadolinium-enhanced MR angiography versus color-guided duplex US—a meta-analysis. Radiology 2000; 216:67-77.[Abstract/Free Full Text]
  86. Hunink MG, Kuntz KM, Fleischmann KE, Brady TJ. Noninvasive imaging for the diagnosis of coronary artery disease: focusing the development of new diagnostic technology. Ann Intern Med 1999; 131:673-680.[Abstract/Free Full Text]
  87. Jarvik JG, Hollingworth W, Martin B, et al. Rapid magnetic resonance imaging vs radiographs for patients with low back pain: a randomized controlled trial. JAMA 2003; 289:2810-2818.[Abstract/Free Full Text]
  88. McMahon P, Araki S, Sandberg E, Neumann P, Gazelle G. Cost-effectiveness of PET in the diagnosis of Alzheimer disease. Radiology 2003; 228:515-522.[Abstract/Free Full Text]
  89. McMahon P, Araki S, Neumann P, Harris G, Gazelle G. Cost-effectiveness of functional imaging tests in the diagnosis of Alzheimer disease. Radiology 2000; 217:58-68.[Abstract/Free Full Text]
  90. McMahon P, Halpern E, Fernandez-del Castillo C, Clark J, Gazelle G. Pancreatic cancer: cost-effectiveness of imaging technologies for assessing resectability. Radiology 2001; 221:93-106.[Abstract/Free Full Text]
  91. Fryback DG. A conceptual model for output measures in cost-effectiveness evaluation of diagnostic imaging. J Neuroradiol 1983; 10:94-96.[Medline]
  92. Fryback DG, Thornbury JR. The efficacy of diagnostic imaging. Med Decis Making 1991; 11:88-94.
  93. Dixon AK. Evidence-based diagnostic radiology. Lancet 1997; 350:509-512.[CrossRef][Medline]
  94. Hunink MG. Outcomes research and cost-effectiveness analysis in radiology. Eur J Radiol 1998; 27:85-87.[CrossRef][Medline]
  95. Begg CB, Greenes RA, Iglewicz B. The influence of uninterpretability on the assessment of diagnostic tests. J Chronic Dis 1986; 39:575-584.[CrossRef][Medline]
  96. Begg CB, Greenes RA. Assessment of diagnostic tests when disease verification is subject to selection bias. Biometrics 1983; 39:207-215.[CrossRef][Medline]
  97. Begg CB, McNeil BJ. Assessment of radiologic tests: control of bias and other design considerations. Radiology 1988; 167:565-569.[Abstract/Free Full Text]
  98. Hunink MG, Richardson DK, Doubilet PM, Begg CB. Testing for fetal pulminary maturity: ROC analysis involving covariates, verification bias, and combination testing. Med Decis Making 1990; 10:201-211.
  99. Metz CE. Some practical issues of experimental design and data analysis in radiological ROC studies. Invest Radiol 1989; 24:234-245.[Medline]
  100. Hanley JA, McNeil BJ. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 1983; 148:839-843.[Abstract/Free Full Text]
  101. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982; 143:29-36.[Abstract/Free Full Text]
  102. Metz CE. ROC methodology in radiologic imaging. Invest Radiol 1986; 21:720-733.[Medline]
  103. Tosteson AN, Begg CB. A general regression methodology for ROC curve estimation. Med Decis Making 1988; 8:204-215.
  104. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993; 13:313-321.
  105. Walter S. Properties of the summary receiver operating characteristic (SROC) curve for diagnostic test data. Stat Med 2002; 21:1237-1256.[CrossRef][Medline]
  106. Stengel D, Bauwens K, Sehouli J, et al. Systematic review and meta-analysis of emergency ultrasonography for blunt abdominal trauma. Br J Surg 2001; 88:901-912.[CrossRef][Medline]
  107. Kealey SM, Dodd JD, MacEneaney PM, Gibney RG, Malone DE. Minimal preparation computed tomography instead of barium enema/colonoscopy for suspected colon cancer in frail elderly patients: an outcome analysis study. Clin Radiol 2004; 59:44-52.[CrossRef][Medline]
  108. Worster A, Preyra I, Weaver B, Haines T. The accuracy of noncontrast helical computed tomography versus intravenous pyelography in the diagnosis of suspected acute urolithiasis: a meta-analysis. Ann Emerg Med 2002; 40:280-286.[CrossRef][Medline]
  109. Jaeschke R, Guyatt GH, Sackett DL. Users’ guides to the medical literature. III. How to use an article about a diagnostic test. B. What are the results and will they help me in caring for my patients? The Evidence-Based Medicine Working Group. JAMA 1994; 271:703-707.
  110. Jaeschke R, Guyatt G, Sackett DL. Users’ guides to the medical literature. III. How to use an article about a diagnostic test. A. Are the results of the study valid? Evidence-Based Medicine Working Group. JAMA 1994; 271:389-391.
  111. Rosser R, Kind P. A scale of valuations of states of illness: is there a social consensus? Int J Epidemiol 1978; 7:347-358.[Abstract/Free Full Text]
  112. Kaplan RM, Anderson JP. A general health policy model: update and applications. Health Serv Res 1988; 23:203-235.[Medline]
  113. Gorry GA, Pauker SG, Schwartz WB. The diagnostic importance of the normal finding. N Engl J Med 1978; 298:486-489.[Abstract]
  114. Syngal S, Weeks JC, Schrag D, Garber JE, Kuntz KM. Benefits of colonoscopic surveillance and prophylactic colectomy in patients with hereditary nonpolyposis colectal cancer mutations. Ann Intern Med 1998; 129:787-796.[Abstract/Free Full Text]
  115. Kent KC, Kuntz KM, Patel MR, et al. Imaging strategies for carotid endarterectomy: an analysis of morbidity and cost-effectiveness. JAMA 1995; 274:888-893.[Abstract/Free Full Text]
  116. Chang P, Friedland GW. The role of imaging in screening for prostate cancer: a decision analysis perspective. Invest Radiol 1990; 25:591-595.[Medline]
  117. Phelps CE, Mushlin AI. Focusing technology assessment using medical decision theory. Med Decis Making 1988; 8:279-289.
  118. Eddy DM. Screening for colorectal cancer. Ann Intern Med 1990; 113:373-384.
  119. Elhauge E. The limited regulatory potential of medical technology assessment. Virginia Law Review 1996; 82:1525- 1622.



This article has been cited by other articles:


Home page
JNMHome page
R. Chisin
Cost-Effectiveness Analysis
J. Nucl. Med., March 1, 2009; 50(3): 338 - 339.
[Full Text] [PDF]


Home page
Health Aff (Millwood)Home page
S. D. Pearson, A. B. Knudsen, R. W. Scherer, J. Weissberg, and G. S. Gazelle
Assessing The Comparative Effectiveness Of A Diagnostic Technology: CT Colonography
Health Aff., November 1, 2008; 27(6): 1503 - 1514.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
C. S. Poon
A Brief Commentary on Cost-Effectiveness Analysis in Radiologic Research
Am. J. Roentgenol., November 1, 2008; 191(5): 1320 - 1322.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
J. R. Petrella, L. M. Shah, K. M. Harris, A. H. Friedman, T. M. George, J. H. Sampson, J. S. Pekala, and J. T. Voyvodic
Preoperative Functional MR Imaging Localization of Language and Motor Areas: Effect on Therapeutic Decision Making in Patients with Potentially Resectable Brain Tumors
Radiology, September 1, 2006; 240(3): 793 - 802.
[Abstract] [Full Text] [PDF]


Home page
Arch SurgHome page
J. R. Rodriguez, S. S. Germes, P. V. Pandharipande, G. S. Gazelle, S. P. Thayer, A. L. Warshaw, and C. Fernandez-del Castillo
Implications and Cost of Pancreatic Leak Following Distal Pancreatic Resection
Arch Surg, April 1, 2006; 141(4): 361 - 366.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gazelle, G. S.
Right arrow Articles by Beinfeld, M. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gazelle, G. S.
Right arrow Articles by Beinfeld, M. T.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE