|
|
||||||||
Editorials |
1 From the Earl A. Chiles Research Institute, Portland, Ore. Received April 21, 2005; revision requested June 14; revision received June 23; final version accepted August 15. Address correspondence to the author, 7400 SW Barnes Rd, A622, Portland, OR 97225 (e-mail: Reichje{at}dnamail.com).
Lung cancer would seem, a priori, to be an ideal screening candidate: It is the most common lethal neoplasm in the United States; it is often detectable in a preclinical phase; radiographs are noninvasive and readily acceptable; individuals at highest riskolder current and former smokersare readily identifiable; when it is detected at an early stage, the "5-year cure" rate is approximately 70%; and it is highly lethal when unresectable (1). Although survival is substantially higher in patients with screening-identified disease than in patients in whom disease was identified by other means, screening is not recommended by recognized authorities because results of two large randomized controlled trials demonstrated that although 5-year survival was far higher in the screened than in the unscreened cohort, lung cancer and all-cause mortality were higher as well (1). This controversy has been reignited by the development of low-dose computed tomography (CT), a far more sensitive method than conventional radiography for identifying resectable lung cancers (1). Screening advocates assert that improvement in lung cancer survival, predicated on stage at diagnosis, suffices to demonstrate efficacy (2,3). Others have pointed out that cause-specific survival is an invalid metric of screening efficacy because it discounts other causes of mortality and it introduces biaseslead-time bias, length-biased sampling, and overdiagnosisthat generate a spurious improvement (4). The purpose of this communication is to delineate the limitations of outcome metrics used to assess lung cancer screening efficacy. Definitive evidence of efficacy requires the demonstration of a reduction in lung cancer and all-cause mortality in prospective controlled trials.
| BIASES INTRODUCED BY SCREENING |
|---|
|
|
|---|
Lead-time bias refers to earlier detection of disease, in which intervention, as long as it is not harmful, will lengthen survival without necessarily improving longevity. Consider a hypothetical screening test for motor neuron dysfunction capable of advancing by 5 years the diagnosis of amyotrophic lateral sclerosis, a highly lethal and untreatable condition. Theoretically, if screening were widely applied, it would, due solely to a shift of time frame, increase 5-year survival from near zero to virtually 100%, while mortality, after the onset of clinical disease, would be unchanged.
Length-biased sampling refers to the preferential screening identification of a more phenotypically favorable population of lung cancers because they are slower growing and therefore have a lengthier preclinical detectable phase. Aggressive lung cancers are less likely to be screening identified because of their briefer preclinical detectable phase. Reflecting this bias, survival for individuals with screening-identified lung cancers, particularly lung cancers identified at prevalence screening examinations, is predictably higher than in the overall population of individuals with lung cancer.
Overdiagnosis (or "pseudodisease") refers to the condition in which screening-identified cancers would not have shortened an individual's longevity (6). Overdiagnosis occurs in two circumstances: (a) Some cancers (eg, some prostate and thyroid cancers, which are frequent, incidental autopsy findings) have such limited biologic potential that they prove to be nonlethal and (b) the course of potentially lethal cancers may be overtaken by competing lethal morbidities. For example, lung cancer frequently occurs in persons with other smoking-related disorderschronic obstructive lung disease and coronary artery disease.
These biases, which are invariably entrained by screening, are interrelated: A protracted lead-time bias increases the likelihood of an intervening lethal comorbidity (resulting in overdiagnosis), and overdiagnosis bias may be considered the extreme example of length-biased sampling, in which the preclinical detectable phase of the neoplasm is so protracted that its course is overtaken by that of a lethal comorbidity.
The influence of overdiagnosis on screening outcomes is critically important: Overdiagnosis imparts a spurious improvement in outcome as measured by survival, and, while both lead-timebiased and length-biased sampling generate a spurious outcome improvement, it is reasonable to assume that some individuals otherwise destined to die of lung cancer benefit from earlier surgical intervention. In contrast, overdiagnosed individuals cannot benefit from earlier intervention because they are destined to die of causes other than lung cancer. Moreover, invasive diagnostic procedures and surgical intervention may shorten their life expectancy. Also, although lead-time bias may be roughly gauged by measurement of growth rates and length-biased sampling may be estimated by the proportion of interval cancers (ie, those that arise between screenings) in one-arm (ie, uncontrolled) investigations, it is impossible to quantify overdiagnosis in such investigations. The magnitude of overdiagnosis and its assessment remain highly controversial and complex subjects that are beyond the scope of this editorial.
| MEASUREMENTS OF SCREENING OUTCOME |
|---|
|
|
|---|
Lung cancer mortality, because it reflects death only from lung cancer, has three limitations: (a) Correctly assigning the cause of death may be impossible in some persons with lung cancer. A careful review of medical records, as was undertaken in the Mayo Lung Program (see below), will more accurately ascertain whether progressive lung cancer was the cause of death than will reference to death certificates (8). However, consider an individual with obstructive airway disease who at surgery is found to have bilateral micronodal metastases. After resection, the patient develops an air leak that necessitates prolonged chest tube drainage, which induces pleural fibrosis; the patient succumbs to respiratory failure months later. Would this event be consistently or correctly classified as a lung cancer death? (b) Lung cancer mortality fails to reflect fatalities caused by lethal comorbidities (which are frequently present in individuals with lung cancer), the courses of which are likely to be foreshortened by invasive diagnostic or therapeutic interventions. (c) Lung cancer mortality omits deaths in the screened cohort caused by intervention after false-positive test resultsfor example, a lethal myocardial infarction that occurs because of an invasive diagnostic or therapeutic intervention for a benign nodule. Black et al (9) coined the term slippery linkage to describe the increased risk of death that is not captured by cause-specific survival or cause-specific mortality statistics. All-cause mortality circumvents these limitations. Were it not for the disadvantage that all-cause mortality requires impractically large samples (because of the dilution of the effect of organ-specific screening by other causes of mortality that it cannot influence), it would be the reference standard for appraising screening efficacy. The usefulness of all-cause mortality lies in augmenting cause-specific mortality, where it may signal unrecognized or underestimated adverse effects of screening and intervention that are not reflected by cause-specific mortality. The reader will find a comprehensive discussion of the importance of all-cause mortality in the article by Black et al (9).
| LUNG CANCER SURVIVAL VERSUS ALL-CAUSE SURVIVAL |
|---|
|
|
|---|
3 cm in size and are surrounded by lung or visceral pleura, with bronchoscopic evidence of spread no more proximal than the lobar bronchus. N0 signifies no hilar or mediastinal metastases; M0 indicates no discernible extranodal metastases.) A cohort of 100 patients with stage IA lung cancer undergo lobectomy; their projected 5-year lung cancer survival rate is 70%. If 50 of the patients died of surgically foreshortened comorbidities at 3 years, while eight lung cancer deaths occurred at 1 year and seven occurred at 3 years, the Kaplan-Meier estimate of 5-year lung cancer survival would be 78%; inclusion of the censored data would result in an all-cause survival rate of 35%. | OUTCOME OF CONTROLLED TRIALS OF LUNG CANCER SCREENING |
|---|
|
|
|---|
Kubïk et al (13) undertook a similar study in Czechoslovakia. After a prevalence screening examination, 6346 participants were randomly allocated to a cohort that underwent semiannual radiographic and cytologic screening or to an unscreened control cohort. At 3 years and annually thereafter for 3 years, both cohorts were radiographically screened. Twenty-six more lung cancers (108 vs 82) were identified in the initially screened cohort than in the control cohort. The majority of excess cases were identified during the initial 3-year screening. The case excess in the screened cohort was therefore (108 82) ÷ 108 = 24%, a value close to the 22% in the Mayo Lung Project. Moreover, in the Czech study as in the Mayo Lung Project, 5-year lung cancer survival markedly favored the screened cohort. Although lung cancer was identified at an earlier stage, lung cancer deaths were higher in the initially screened cohort (n = 45) than in the control cohort (n = 40). After an additional 3 years of follow-up limited to patients with lung cancer, there were 85 lung cancer deaths in the initially screened cohort versus 67 such deaths in the control cohort. All-cause mortality was substantially higher in the initially screened cohort than in the control cohort17.9 versus 15.4 deaths per 1000 person-years (9).
Manser et al (14) pooled the updated Mayo (11) and Czech (13) study results and found the risk of death from lung cancer to be 11% higher in the intervention groups, with a relative risk (RR) of 1.11 (95% confidence interval [CI]: 1.00, 1.23) and a P value of .05. For all-cause mortality, they reported an intervention cohort RR of 1.16 (95% CI: 1.00, 1.35) in the Czech study and 1.03 (95% CI: 0.93, 1.14) in the Mayo study.
Screening, if it is to be effective, must intercept the growth of cancer at an early stage, while it remains resectable. A reduction in the absolute number (ie, not the proportion) of advanced (unresectable) cancers, referred to as "stage shift," is the fundamental underlying premise of screening. Neither the Mayo nor the Czech study found a reduction in the number of advanced lung cancers in the screened versus the control groups. Both the Mayo and the Czech studies revealed an excess number of lung cancers and an excess of lung cancer and all-cause mortality in the intervention cohort, although the differences were greater in the Czech study. Black et al (9) emphasized the disparity between lung cancer and all-cause mortality in both studies (values are deaths per 10 000 person-years): In the extended follow-up of the Mayo trial, lung cancer mortality in the intervention cohort exceeded that in the control cohort by 4.4, while all-cause mortality was higher by 6.3. In the Czech trial, lung cancer mortality in the intervention cohort exceeded that in the unscreened cohort by 8.9, while all-cause mortality was higher by 25.3.
| NET BENEFIT OF LUNG CANCER SCREENING |
|---|
|
|
|---|
| IMPACT OF LOW-DOSE CT SCREENING |
|---|
|
|
|---|
| SUMMARY AND CONCLUSIONS |
|---|
|
|
|---|
In conclusion, use of the lung cancer mortality metric circumvents the three biases introduced by screening, but, unlike all-cause mortality, this metric fails to fully capture the adverse consequences of invasive procedures, including resection.
To achieve an improvement in lung cancer outcome, low-dose CT, which will predictably increase the proportion of overdiagnosed lung cancer, would have to achieve a reduction in lung cancer mortality sufficient to offset this incremental adverse effect.
The efficacy of low-dose CT screening can be ascertained only by a comparison of lung cancer mortalityaugmented by all-cause mortalityin both arms of a large randomized prospective controlled trial.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
See also the editorial by Gur in this issue.
| References |
|---|
|
|
|---|
Related Article
This article has been cited by other articles:
![]() |
J M Reich A critical appraisal of overdiagnosis: estimates of its magnitude and implications for lung cancer screening Thorax, April 1, 2008; 63(4): 377 - 383. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Wilck Computed Tomography Screening for Lung Cancer Ann. Thorac. Surg., February 1, 2008; 85(2): S699 - S700. [Full Text] [PDF] |
||||
![]() |
J. Reich, J. Asaph, D. J. Raz, and D. M. Jablons Natural History of Stage I Lung Cancer Chest, December 1, 2007; 132(6): 2062 - 2063. [Full Text] [PDF] |
||||
![]() |
W. C. Black, D. R. Aberle, C. D. Berg, and For the Executive Committee of the National Lung S Large Field Trial for Lung Cancer Screening: Putting the Wrong Cart before the Horse? Radiology, May 1, 2007; 243(2): 314 - 316. [Full Text] [PDF] |
||||
![]() |
H. Schoder and M. Gonen Screening for Cancer with PET and PET/CT: Potential and Limitations J. Nucl. Med., January 1, 2007; 48(1_suppl): 4S - 18S. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |