Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


DOI: 10.1148/radiol.2342040026
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Chapman, B. E.
Right arrow Articles by Gur, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Chapman, B. E.
Right arrow Articles by Gur, D.
(Radiology 2005;234:582-590.)
© RSNA, 2005


Thoracic Imaging

Lung Cancer Screening: Simulations of Effects of Imperfect Detection on Temporal Dynamics1

Brian E. Chapman, PhD, David F. Yankelevitz, MD, Claudia I. Henschke, MD, PhD and David Gur, ScD

1 From the Department of Radiology, University of Pittsburgh, Imaging Research, Suite 4200, 300 Halket St, Pittsburgh, PA 15213 (B.E.C., D.G.); and Department of Radiology, Weill Medical College of Cornell University, the New York Hospital-Cornell Medical Center, New York, NY (D.F.Y., C.I.H.). Received January 9, 2004; revision requested March 10; revision received March 29; accepted April 28. Address correspondence to B.E.C. (e-mail: chapmanbe@upmc.edu).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
PURPOSE: To use a mathematic model to demonstrate effects of imperfect detection on temporal dynamics of radiologic lung cancer screening.

MATERIALS AND METHODS: Monte Carlo simulations of lung cancer screening programs were performed in subjects at high risk for developing cancer. The effects of detection probabilities, symptomatic presentation of tumors, tumor volume doubling time, and time between screenings were examined. Computed tomography (CT) and chest radiography models were used.

RESULTS: For imperfect detection probabilities, the percentage of subjects with cancers detected with repeated screenings decreased to a steady-state value. The transition period was the period during which screenings were performed and detection rates decreased. At steady-state repeat screening, the proportion of subjects with cancers diagnosed at screening or by means of symptomatic presentation was determined by the annual probability of developing cancer and not by the sensitivity of the screening modality. The sensitivity of the screening technique did affect detected cancer size, number of interval cancers, and total number of cancers observed. CT was used to detect more total cancers over the course of the screening program and cancers with a smaller average size; moreover, fewer interval cancers were observed with CT screening than with chest radiography screening.

CONCLUSION: Lung cancer screening with imperfect detection has a transition period between baseline screening and steady-state behavior of annual screenings. Advantages of CT screening include a decrease in the average cancer size at detection, a decrease in the number of observed interval cancers, and an increase in the total number of cancers observed. Steady-state behavior indicates that long-term trials of screening may not be necessary.

© RSNA, 2005


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
Lung cancer has become the most common cause of cancer-related death in the world (1). Radiologic screening of individuals with increased risk for developing lung cancer has been advocated as a means of detecting cancer at an earlier and possibly more curable stage; early detection might lead to more effective treatment, with a corresponding reduction in morbidity and mortality associated with the disease (24). Screening on the basis of the findings of chest radiography has shown only limited benefit, partly because of the low detection probability and large size of detected cancers (57). Poor detection with chest radiography is usually attributed to the large amount of structural noise (ie, the superposition of normal structures) (7). Computed tomography (CT) greatly reduces the structural noise of an image. Multi–detector row helical CT provides a rapid, accurate, and robust alternative radiologic technique for lung cancer screening (3). Since CT is sensitive to smaller cancers, a greater proportion of cancers detected in the screening population areasymptomatic and potentially more curable than those detected with chest radiography (4,712).

A perfect screening modality would be able to depict any cancer with a probability of one. In reality, all screening modalities have imperfect sensitivities, in that they can only depict cancers larger than some action size (ie, the minimum tumor diameter for which the radiologist will record a positive or semipositive finding [13]). Tumors that are larger than the action size require different follow-up than tumors that are smaller than the action size, and the probability of detecting cancers larger than the action size is less than one. These detection imperfections may be due to the technology, the observer, or both. Imperfections affect the temporal dynamics of a cancer screening program, ranging from the number of cancers detected at baseline screening, the duration of the transition period (ie, the period between baseline screening and steady-state screening during which the percentage of screening subjects with detected cancers at each screening is decreasing), and the size (hence, the potential curability) of cancers observed.

Lung cancer screening can be viewed as consisting of four phases. The first phase consists of disease development prior to screening. The second phase consists of baseline screening, when asymptomatic subjects are imaged. The third phase consists of transition screenings, when the percentage of subjects with detected cancers gradually decreases during repeated screening until a steady state is reached. The fourth phase consists of a steady-state condition, during which any repeated screening behaves largely the same. These phases are described in more detail in the Appendix.

Mathematic modeling of cancer screening has been presented by other researchers (1416). On the basis of results of screening programs conducted in the 1980s, Flehinger and Kimmel (16) modeled lung cancer screening and focused on the effect on mortality. To our knowledge, the temporal dynamics of a lung cancer screening program based on a more sensitive test, such as multi–detector row helical CT, have not been described in detail. These dynamics are of interest in the design of screening trials in which the number of positive cases during the study must be estimated.

Thus, the purpose of our study was to use a mathematic model to demonstrate the effect of imperfect detection on the temporal dynamics of radiologic lung cancer screening programs.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
Mathematic Model and Assumptions
Cancer development and growth.—After reviewing the findings of Henschke et al (17), we assumed that the high-risk screening population had an annual probability of .005 (five per 1000 subjects) of developing cancer. Cancers were assumed to be spheres, and they were modeled as initiating randomly through time with a uniform distribution. The initial diameter of the cancer was assumed to be 2 µm; this diameter represents smaller human cells, which typically range in diameter from 2 to 200 µm (18). Cancer growth was modeled according to the report of Hasegawa et al (19) as V(t) = V02t/VDT, where V0 is the volume corresponding to the initial radius, t is the time in days, and VDT is the volume doubling time in days. For any given volume, the diameter of the tumor (d) was estimated by using the formula for the volume of a sphere (V) as follows: d = (6V/{pi})1/3.

Published values for VDT vary. Usuda et al (20) reported a mean VDT of 163.7 days. By contrast, Hasegawa et al (19) reported a mean VDT for adenocarcinomas of up to 452 days; however, their study included subtypes that were not typical solid tumors. In a review of the Mayo Lung Project and the Memorial Sloan-Kettering Cancer Center lung screening data, Yankelevitz et al (21) reported mean VDT values of 101 and 144 days, respectively. They suggested that reported mean VDT values of more than 400 could be due to overdiagnosis. In our simulation, we chose a middle ground between the Mayo Lung Project and Memorial Sloan-Kettering studies; thus, we modeled VDT as a Gaussian distribution with a population mean of 125 days (standard deviation, 30 days). The standard deviation was chosen to approximate the minimum doubling times observed in these studies. We did not attempt to model the effect of a skewed tail resulting from slow-growing cancers. Because we used a Gaussian function, there is a very small probability that a cancer will have a computed negative VDT value. We do not explicitly address this issue; however, we assume that these few cancers are actually initiated but never grow.

When cancers reached a critical diameter, they were assumed to become symptomatic; patients with such cancers were removed from the screening population. The actual size when a cancer becomes symptomatic is not always well defined. Cancers that are detected between screenings in symptomatic subjects are referred to as interval cancers. Not all interval cancers are symptomatic, however, because interval cancers also include cancers detected incidentally during other diagnostic studies that are not part of the screening. In the chest radiography screening study performed at Memorial Sloan-Kettering Cancer Center (New York Screening Program), Heelan et al (22) found that interval cancers have a mean size of 39 mm (range, 11–93 mm). These interval cancers included all incidental cancers found with radiologic studies performed for any reason and all cancers in patients who presented with symptoms. By contrast, the mean size of cancers detected with the actual screening program was 24 mm. For this simulation, we chose a threshold of 40 mm for symptomatic presentation. We did not model the effect of smaller cancers being detected incidentally during other imaging studies. We did not model other causes of death or removal from the screening program; however, we assume that these would be equally distributed throughout all patients (ie, both those with and those without cancer) in the study and would not affect the proportions we are examining.

Cancer detection.—To simplify the model, we assumed that cancer detection was independent from year to year; namely, the availability of results of prior studies did not affect detectibility. We modeled cancer detection by using several different functions. Cancer detection with CT was based on the experience at Weill Medical College of Cornell University, New York, NY, in the Early Lung Cancer Action Project (Henschke CI, unpublished data, 1999). We used the following tumor diameters to define CT detection probability (pdet):

{r05fe03e01}

From cancer detection with chest radiography in the same data set, we modeled the chest radiography detection function as follows:

{r05fe03e02}

Finally, for the purpose of illustration, we examined a simple detection function:

{r05fe03e03}
where the probability of detecting cancers larger than the action diameter (da) is not zero (pd) and zero for all cancers smaller than the action diameter.

For these simulations, we set the action diameter to be 2 mm, which was the size of the smallest detectable cancer in the CT model.

When cancer was observed in a subject, that subject was removed from the screening population. Thus, as the course of the screening program progressed, the number of screening subjects decreased.

Initial conditions.—Initial conditions for the screening population were created by simulating an initially cancer-free population for 20 years without a screening intervention. The initial population size was 100 000 patients, and the annual probability of developing cancer was .005. In all the screening simulations, multiple cancers within a patient were not modeled. Patients who developed cancers larger than the symptomatic size were removed from the population. At the end of 20 years, 10 000 subjects were randomly selected from the remaining pool of patients without symptoms to undergo screening; baseline screening was performed to initiate the screening program. By 20 years, both the population cancer burden and the average cancer size had leveled off, which indicated that a reasonable approximation of a steady-state value had been reached (Fig 1). The population tumor burden continued to increase slowly, however, since symptomatic subjects were not replaced with tumor-free subjects.



View larger version (16K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1a. Graphs show (a) the percentage of subjects with asymptomatic cancers and (b) the mean size of asymptomatic cancers. By year 20, both the population tumor burden and mean cancer diameter have leveled off to steady-state values, indicating that the population is in an appropriate state to begin the screening simulation.

 


View larger version (17K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1b. Graphs show (a) the percentage of subjects with asymptomatic cancers and (b) the mean size of asymptomatic cancers. By year 20, both the population tumor burden and mean cancer diameter have leveled off to steady-state values, indicating that the population is in an appropriate state to begin the screening simulation.

 
Screening frequency.—We assumed an annual screening frequency.

Summary of model assumptions.—We have summarized some of the more important assumptions of the models. First, we assumed sharp thresholds for both the action size and the size when a cancer becomes symptomatic. Second, we assumed that each screening was independent (ie, results of prior studies were not examined). Third, we assumed that there were no deaths or reasons to be removed from the screening program other than detection of or manifestation of lung cancer. Fourth, we assumed that cancer incidence remained constant over the course of the screening program. Fifth, we assumed a Gaussian distribution of VDT. Our model was intended to be a simple first-order approximation.

Monte Carlo Simulations
The simulation was written with Interactive Data Language, version 5.6 (Research Systems, Boulder, Colo). For each case, 3000 runs of the model were computed. The reported values represent the mean value across the 3000-run output for each set of input parameters. We performed baseline simulations for the constant, CT, and chest radiography detection functions. For these baseline simulations, we assumed an incidence of 0.005, a VDT with a mean Gaussian distribution of 125 days ± 30, and a critical diameter of 40 mm.

Sensitivity Analysis
We performed sensitivity analysis to assess the dependence of our model on assumed parameter values for critical diameter, VDT, screening interval, detection probabilities, initial cancer size, and cancer incidence.

We varied critical diameter between 30 and 60 mm to examine the dependence of the model on the value of critical diameter. This range is narrower than the range reported by Heelan et al (22) because we modeled only the average symptomatic group and because we did not include incidental cancers. We varied the mean VDT from 100 to 150 days and kept the standard deviation of VDT constant at 30 days. We varied the screening intervals from one screening every 4 months to one screening every 2 years. We examined the effect of different detection probabilities on behavior during the steady state. We adjusted the detection functions by shifting the detection probabilities for the cancer diameter by 1 mm. That is, we modified Equations (1) and (2) to use the given detection probability for a cancer with a smaller or larger diameter. We also varied the initial cancer diameter (2–20 µm). Finally, we varied the underlying cancer incidence (0.3%–0.7%). For each of these analyses, we used baseline values for the fixed parameters.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
In this section, we report the expected mean values (computed from the 3000 repeated simulations) of the variables examined rather than the actual values for any specific simulation. The computed values for a particular simulation are integers (ie, number of interval cancers), but these are reported here in expected values. In addition, because the results are based on large simulated populations (100 000 subjects for the start-up population and 10 000 subjects for the screening population) and 3000 repeated realizations of the simulation, the standard errors of the estimated values are 0.02%–0.10% for the larger values (ie, number of screened subjects and detected cancers) and 2%–4% for the smaller values (ie, number of interval cancers).

At the initiation of the screening program, the expected tumor burden (ie, percentage of screening population with cancer of any size) was 7.1% (708 of 10 000 patients). These cancers had an average diameter of 4 mm, an average duration of 7.8 years, and an average VDT of 132 days (Fig 2). The shift in VDT distribution to 125 days is due to more aggressive cancers becoming symptomatic earlier and being removed from the start-up population prior to the initiation of screening. Note that the histograms in Figure 2 include all cancers present (ie, detected and undetected) in the population.



View larger version (25K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. Graph shows distribution of VDT values for cancers in the screening population at baseline screening and for cancers that develop after baseline screening. VDT values (mean, 125 days ± 30) are generated from a Gaussian distribution. During the start-up period, tumors with shorter VDT values were more likely to become symptomatic, producing a positive shift in VDT values for tumors present at the onset of screening. Tumors that developed during the screening period were not affected. Because we used a Gaussian distribution, about two of every 100 000 cancers had a nonphysical negative VDT value. Such cancers were assumed to initiate but never grow.

 
We describe the year-to-year behavior of the screening program in terms of the expected percentage of screening subjects with cancers detected with radiography, the expected percentage of subjects with observed cancers, and the average size of detected cancers. We only plot the baseline and first five repeat screenings so that the details of the transition periods can be appreciated. We show percentages rather than actual numbers because the number of screening subjects changes as the study progresses.

For constant detection functions (Fig 3), we observed a decrease in the number of cancers detected at baseline and an increase in the duration of the transition period as the detection probability decreased (Fig 3a). There was also a slight decrease in the percentage of patients with cancers detected with radiology during the steady state. Ultimately, however, the total percentage of subjects with observed cancers (ie, interval cancers or cancers detected with screening) is identical, regardless of detection probability. (Fig 3b). As detection probability decreases, the mean diameter of detected cancers increases noticeably (Fig 4).



View larger version (30K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3a. Graphs show the percentage of screening subjects with (a) cancer detected with radiologic screening for three constant detection functions and (b) observed cancers. Zero indicates baseline screening. The action size for these functions was 2 mm. The percentage of observed cancers in the steady state is the same for all three functions. As the detection rate decreased, the percentage of subjects with cancers detected at baseline decreased. Missed tumors resulted in a transition period between baseline screening and steady-state screening; a larger percentage of cancers were detected with poorer performing techniques during transition screenings.

 


View larger version (32K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3b. Graphs show the percentage of screening subjects with (a) cancer detected with radiologic screening for three constant detection functions and (b) observed cancers. Zero indicates baseline screening. The action size for these functions was 2 mm. The percentage of observed cancers in the steady state is the same for all three functions. As the detection rate decreased, the percentage of subjects with cancers detected at baseline decreased. Missed tumors resulted in a transition period between baseline screening and steady-state screening; a larger percentage of cancers were detected with poorer performing techniques during transition screenings.

 


View larger version (26K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4. Graph shows mean diameters of detected cancers for the constant detection functions. Zero indicates baseline screening. Action size for detection was 2 mm. As detection rate decreased, mean diameter of detected cancers increased because of additional growth cycles prior to detection.

 
Relative to CT screening, chest radiography screening was used to detect substantially fewer cancers at baseline and in the steady state, and an identical percentage of total cancers was observed in the steady state (Fig 5). To clarify the presentation of the different expected percentages, we show the actual number of subjects who are screened each year with CT and chest radiography (Fig 6). Mean diameter of detected cancers decreased from the baseline value to the steady-state value for both CT and chest radiography, with a larger decrease for CT (Fig 7). For CT, the mean VDT of cancers detected at baseline screening was 132 days, and the mean VDT of steady-state cancers detected at annual screening was 126 days. For chest radiography, the mean VDT of cancers detected at baseline screening was 131 days, and the mean VDT of steady-state cancers detected at annual screening was 132 days. In the chest radiography screening population, observed and unobserved cancers had a steady-state mean VDT of 129 days. Note that the shift in the overall population VDT values is smaller for chest radiography than for CT.



View larger version (24K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 5a. Graph shows the percentage of screening subjects with (a) cancers detected with CT and chest radiography (CXR) and (b) observed cancers. While chest radiography detected cancers in a smaller percentage of the population each year, during the steady state, the same percentage of patients had observed cancers in both CT and chest radiography screening.

 


View larger version (23K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 5b. Graph shows the percentage of screening subjects with (a) cancers detected with CT and chest radiography (CXR) and (b) observed cancers. While chest radiography detected cancers in a smaller percentage of the population each year, during the steady state, the same percentage of patients had observed cancers in both CT and chest radiography screening.

 


View larger version (21K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 6. Graph shows total number of screening subjects per year undergoing CT and chest radiography (CXR). Zero indicates baseline screening. As cancers are detected, patients are removed from the screening population. Because of the high sensitivity of CT, the screening population size decreases more rapidly after the baseline examination. In subsequent years, the screening population decreases at a similar rate for both CT and chest radiography. Because the available population is continually decreasing, steady-state behavior is demonstrated by looking at percentages.

 


View larger version (18K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 7. Graph shows mean diameters of cancers detected with CT and chest radiography (CXR) during each screening. Zero indicates baseline screening. For both CT and chest radiography, mean diameter of cancers detected with screening decreases to a steady-state value.

 
Chest radiography is used to detect cancers in a later and larger state CT. The mean duration of cancer detected in the steady state with CT is 11 years, whereas the mean duration of cancer detected in the steady state with chest radiography is 14 years. The cumulative number of cancers detected with screening diverges between CT and chest radiography, but the total number of cancers observed reaches a plateau, with the difference between CT and chest radiography being essentially the difference in the number of cancers detected during baseline screening (Fig 8).



View larger version (24K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 8. Graph shows difference between the cumulative number of cancers detected with CT and chest radiography (CXR). Screening populations consisted of 10 000 asymptomatic subjects at baseline. Zero indicates baseline screening. The difference in the number of cancers observed at screening with CT and chest radiography is relatively constant over time and differs primarily by the number of cancers observed at baseline screening. The difference between the number of cancers detected with CT and the number of cancers detected with chest radiography, however, steadily increases over the course of the screening program.

 
Critical diameter.—During the steady state, the expected percentage of observed cancers that was made up of interval cancers shows an exponential dependence on critical diameter, varying from 0.13% (0.0607 of 45.000 cancers) to 0.025% (0.011 of 44.581 cancers) for CT and from 7.6% (3.485 of 45.575 cancers) to 33.7% (15.267 of 45.317 cancers) for chest radiography. Over the same range of critical diameter, the expected percentage of patients with cancers detected at baseline screening ranges from 1.7% (168.37 of 10 000 patients) to 2.2% (215.55 of 10 000 patients) for CT and from 0.41% (40.73 of 10 000 patients) to 0.79% (78.87 of 10 000 patients) for chest radiography. More baseline cancers are detected with CT than with chest radiography because CT has greater sensitivity, particularly in the detection of small cancers. The mean size of cancers detected with CT in the steady state did not change with the change in critical diameter; however, the mean size of cancers detected with chest radiography in the steady state changed from 18.7 to 24.5 mm with the change in critical diameter.

VDT.—As the mean VDT varied from 100 to 150 days, the expected percentage of interval cancers observed during steady state ranged from 0.51% (0.228 of 44.958 cancers) to 0.0045% (0.002 of 44.632 cancers) for CT and from 26.2% (11.9023 of 45.4323 cancers) to 11.1% (5.0140 of 45.3240 cancers) for chest radiography. Over the same range of VDT values, the expected percentage of subjects with cancers that were detected at baseline ranged from 1.5% (152.94 of 10 000 patients) to 2.1% (212.87 of 10 000 patients) for CT and from 0.47% (46.79 of 10 000 patients) to 0.61% (60.62 of 10 000 patients) for chest radiography. As VDT increases, mean size of detected cancer in the steady state decreases from 5.3 to 3.8 mm for CT and from 22.5 to 20.9 mm for chest radiography. The change in expected size of the cancers detected at baseline screening was very small.

Screening interval.—When screening was performed at 4-month intervals rather than at yearly intervals, the expected percentage of observed cancers that were interval cancers decreased from 0.075% (0.0333 of 44.6133 cancers) to 0.0082% (0.0013 of 15.8013 cancers) with CT and from 16.8% (7.599 of 45.109 cancers) to 0.75% (0.1203 of 16.0703 cancers) with chest radiography. When the screening interval was extended to 2 years, 1.49% (1.2143 of 81.5743 cancers) of cancers observed with CT were interval cancers, while 46.4% (38.5103 of 83.0803 cancers) of cancers observed with chest radiography were interval cancers. Over the range of screening intervals, the mean size of detected cancers ranged from 2.6 to 7.7 mm for CT and from 15.5 to 23.5 mm for chest radiography. The mean tumor duration of detected cancers ranged from 10.56 to 11.86 years with CT and from 13.13 to 14.43 years with chest radiography.

Detection probability.—When the CT scanner was made sensitive to cancers 1 mm smaller, the average diameter of a detected cancer decreased from 4.3 to 4.2 mm, and the expected proportion of observed cancers that were interval cancers decreased to 0.06% (0.0263 of 45.0363 cancers). When the CT scanner was made sensitive to cancers 1 mm larger, the average diameter of a detected cancer increased to 6.2 mm, and the expected percentage of observed cancers that were interval cancers increased to 0.1% (0.0477 of 44.9677 cancers). When chest radiography was made sensitive to cancers 1 mm smaller, the average diameter of a detected cancer decreased from 21.8 to 20.4 mm, and the expected percentage of observed cancers that were interval cancers decreased to 14.5% (6.567 of 45.187 cancers). When chest radiography was made sensitive to cancers 1 mm larger, the average diameter of a detected cancer increased to 23.0 mm, and the expected percentage of interval cancers increased to 19.3% (8.7707 of 45.3907 cancers).

Initial cancer size.—As the initial cancer diameter increased from 2 µm to 10 µm and 20 µm, the mean duration of a cancer detected with CT during the steady state decreased from 11.0 to 8.7 and 7.7 years, respectively. For chest radiography, the mean duration decreased from 14.0 to 11.5 and 10.5 years, respectively. There were no changes in the number of cancers detected at baseline screening, the average size of a detected cancer at steady state, or the proportion of interval cancers at steady state.

Cancer incidence.—The expected percentage of patients with cancers detected with baseline screening ranged from 1.1% (113.67 of 10 000 patients) to 2.6% (262.54 of 10 000 patients) for CT and from 0.34% (34.31 of 10 000 patients) to 0.80% (79.72 of 10 000 patients) for chest radiography. Neither the expected proportion of observed cancers that were interval cancers nor the average size of detected cancers changed during the steady state.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
These simulations show that when cancer detection probability for a lung cancer screening program is imperfect, the proportion of subjects with detected cancers has a transition period from the prevalence value to the steady-state value. In the steady state, the annual percentage of observed cancers (ie, the sum of interval cancers and cancers detected with radiology) equals the annual incidence. The transition is due to cancers that are missed at baseline screening. The duration of the transition period depends on the detection performance. For the detection rates examined in this study, the transition period may require several screenings. The CT model reached an essentially steady-state condition at the third screening. During the transition period, modalities with poorer detection rates actually detect more cancers, and the cancers that are detected have a larger average diameter. The poorer the detection rate is, the longer the transition period will be. The implication is that several repeated screenings may need to be performed before it can be assumed that a steady state has been reached.

The percentage of subjects with observed cancer in the steady state is ultimately determined by the incidence of new cancers that develop each year and is independent of the screening modality. By using constant detection functions, we demonstrated that lower detection probabilities resulted in increased size and hence, increased stage of detected cancers, as the cancers had additional screening cycles to grow.

This behavior was also illustrated with the CT and chest radiography detection probabilities. The total percentage of observed steady-state cancers in the screening population is the same for CT and chest radiography. Because chest radiography tends to detect larger cancers than does CT, however, a greater percentage of steady-state cancers are interval cancers. Also, because CT detects more cancers than does chest radiography at baseline screening, the screening population is smaller in the steady state, which reduces the actual number of cancers observed during each screening interval.

These simulations showed that CT screening has the following advantages over chest radiography: First, more total cancers will be observed with a CT-based screening program than with a chest radiography screening program. This difference in observed cancers is a result of the fact that more cancers are detected with CT than with chest radiography at baseline screening. Even though the same percentage of screening subjects who undergo chest radiography and CT have observed cancers in the steady state, chest radiography never catches up with CT. Second and more important, because CT is more sensitive than chest radiography in the detection of small cancers, CT detects cancers at an earlier stage of their development; this results in a substantial reduction in the number of interval cancers and potentially reduces the mortality associated with this disease. Because of its high sensitivity, CT can result in almost no symptomatic presentation of cancers if the screening interval is short enough relative to the tumor VDT; this was demonstrated in the sensitivity analysis for both VDT and the screening interval.

There are three important limitations to this study. First, we have presented a simple model, and our results are dependent on our model assumptions. It was not our intent to replicate the precise values observed in various screening programs. Rather, we wanted to use a simple numeric model to illustrate the oft-overlooked dynamics of a screening process. For example, our threshold for symptomatic cancers is an oversimplification. Published results show a large overlap between the average sizes of screening-detected cancers and interval cancers. Hence, we would not expect to achieve a close numeric match. We described cancer growth as a simple exponential function. Cancer growth may be modeled more accurately by using Gompertzian functions, which account for saturated growth rates demonstrated in larger tumors (2325). We do not believe that accounting for the saturated growth rates would affect the temporal dynamics, but we would see a change in the proportion of interval cancers. We also used a simple symmetric distribution for VDT; thus, we did not capture the effects of extremely slow-growing cancers. The detection probabilities we used were based on our own experience and may not reflect detection probabilities observed by others. Since we assumed that the performance and availability of prior studies did not affect detection probabilities in subsequent examinations, we probably underestimated detection during follow-up screenings. In reality, detection will vary with observer skill and the operating point selected for balancing sensitivity and specificity, as will be discussed later. We implicitly assume that the sensitivities are representative of the skill and operating point of the typical radiologist. The simulation is most dependent on detection probabilities, VDT, and critical diameter values. As these values decrease, the number of cancers detected at baseline screening decreases, and the number of interval cancers and mean size of detected cancers increases.

A second limitation of this study is that we did not account for mortality from other causes; therefore, we could not address overdiagnosis (21,26). The commonly presented view is that overdiagnosis occurs when cancer is diagnosed, but the subject dies of other causes before he or she dies of cancer. Overdiagnosis occurs most often at baseline and transition screenings, when very slow-growing cancers have grown to a detectable size prior to initiation of the screening program. With the common viewpoint, however, even fast-growing cancers can be considered overdiagnosed if the subject dies of other causes. Thus, overdiagnosis can occur throughout the screening program. Assuming that mortality rates are approximately constant throughout the screening program, however, overdiagnosis of a typical cancer will occur uniformly throughout the screening program and will not affect the relative proportions we have presented. Techniques such as CT, which are sensitive to small cancers, can potentially produce more cases of overdiagnosis than can techniques such as chest radiography, which are less sensitive to small cancers, because the cancers are detected earlier, thus increasing the temporal interval between diagnosis and death due to cancer. This increased temporal interval increases the probability that death will occur because of causes other than cancer after cancer has been detected.

A third limitation of this study is that we did not address the effect of specificity on the screening program. As recently pointed out by Hillman (27), the cost of unnecessary biopsies and other diagnostic tests as a result of false-positive findings can potentially exceed the cost of screening, thus reducing and potentially eliminating the benefits of screening. False-positive CT findings are believed to be of particular interest because CT is highly sensitive to small structures; hence, the potential cost-effectiveness of CT screening for lung cancer is controversial. In two recent reports in which this problem was evaluated (28,29), the researchers reached opposite conclusions. A model that allows lung cancer screening programs to be more fully evaluated would need to include the presence of benign lung nodules and account for the costs associated with procedures and interventions subsequent to screening.

The temporal dynamics we have illustrated have implications in the design of studies intended to demonstrate the benefit of lung cancer screening. In particular, our results indicate that with imperfect detection, the steady state should not be assumed at the first repeated screening. As the screening program proceeds, the presence of a steady state can be appreciated by plotting either the percentage of screening subjects with observed cancers or the average diameter of screening detection cancers or by using the approximate mathematic technique, as described in the Appendix. The steady state demonstrated with this model indicates that long-term trials of screening may not be necessary because repeated screenings produce nearly identical results after 2 or 3 years.


    APPENDIX
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 
Herein, we describe in greater detail the four conceptual phases of a lung cancer screening program.

Phase One: Disease Development
Prior to the initiation of screening, cancers develop in patients. As cancers become symptomatic, patients seek treatment and are no longer part of the potential screening population.

Phase Two: Baseline Screening
Once screening is initiated, we can view the asymptomatic cancers as being in one of three categories. The first category includes cancers that are smaller than the action size. These cancers either would not be observed or would be observed, but the radiologist would not recommend any action regarding the cancer. The second category includes cancers that are larger than the action size and are detected with the screening modality. The third category includes cancers that are larger than the action size but are missed with the screening modality. These categories persist throughout the screening program. If the probability of detection of cancers larger than the action size is one, then there are no missed cancers. The actual detection probability of the imaging modality is dependent on the physical characteristics of the imaging system, the criteria for interpreting the images, and the performance of the reader.

Phase Three: Transition Screenings
If the probability of detection (ie, sensitivity) for cancers larger than the action size was one, the screening program would achieve a steady-state condition immediately after baseline screening. If the imaging modality and observer have a sensitivity of less than one in the detection of cancers larger than the action size, however, some larger cancers will be missed with baseline screening. These cancers will persist in subjects for subsequent screening periods. The presence of these additional cancers results in a greater number of cancers that are observed during screening periods immediately after baseline screening than during later screening periods. For a given probability of detecting a cancer larger than the action size of pd, the probability of a cancer being missed is calculated by subtracting pd from one. Assuming a uniform detection probability for all cancers larger than the action size and assuming that each screening is independent of prior screenings, after a number (N) screenings, the probability of a cancer being missed is calculated with the following equation: (1 – pd)N. Although nonzero for all finite N and pd less than one, the detection probability eventually approaches a negligible value or the missed cancers become symptomatic and are removed from the screening population. The duration of the transition period can be estimated by computing which value of N leads to a negligible value for (1 – pd)N. In reality, pd increases as cancers become larger; this results in the probability of N misses approaching zero more rapidly than described previously, with a consequently shorter transition phase.

Phase Four: Steady-State Screenings
Assuming that the annual incidence of cancer remains constant in the screening population, after cancers that are present in the population before the initiation of the screening are detected (either at baseline screening or transition screenings) or present symptomatically (interval cancers), the percentage of screening subjects with cancers detected with periodic screening will approach a steady-state condition where an equal number of cancers are detected during each screening. Cancers that become symptomatic will be detected clinically during the interval between screenings (interval cancers). The percentage of screening subjects with interval cancers also becomes a constant for each screening period.


    FOOTNOTES
 
Abbreviation: VDT = volume doubling time

Authors stated no financial relationship to disclose.

Author contributions: Guarantor of integrity of entire study, B.E.C.; study concepts, all authors; study design, B.E.C., D.G.; literature research, B.E.C., D.F.Y., C.I.H.; data acquisition, B.E.C.; data analysis/interpretation, all authors; manuscript preparation, B.E.C.; manuscript definition of intellectual content, all authors; manuscript editing, B.E.C.; manuscript revision/review and final version approval, all authors


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX
 REFERENCES
 

  1. Levi F, Lucchini F, Negri E, La Vecchia C. Worldwide patterns of cancer mortality, 1990–1994. Eur J Cancer Prev 1999; 8:381-400.[Medline]
  2. Petty TL. Screening strategies for early detection of lung cancer: the time is now. JAMA 2000; 284:1977-1980.[Free Full Text]
  3. Henschke CI, Miettienen OS, Yankelevitz DF, Libby DM, Smith JP. Radiographic screening for cancer: proposed paradigm for requisite research. Clin Imaging 1994; 18:16-20.[CrossRef][Medline]
  4. Sone S, Takashima S, Li F, et al. Mass screening for lung cancer with mobile spiral computed tomography scanner. Lancet 1998; 351:1242-1245.[CrossRef][Medline]
  5. Fontana RS, Sanderson DR, Woolner LB, et al. Screening for lung cancer: a critique of the Mayo Lung Project. Cancer 1991; 67:1155-1164.[CrossRef][Medline]
  6. Soda H, Tomita H, Kohno S, Oka M. Limitation of annual screening chest radiography for the diagnosis of lung cancer. Cancer 1993; 72:2341-2346.[CrossRef][Medline]
  7. Li F, Sone S, Abe H, MacMahon H, Armato SG, 3rd, Doi K. Lung cancers missed at low-dose helical CT screening in a general population: comparison of clinical, histopathologic, and imaging findings. Radiology 2002; 225:673-683.[Abstract/Free Full Text]
  8. Kaneko M, Eguchi K, Ohmatsu H, et al. Peripheral lung cancer: screening and detection with low-dose spiral CT versus radiography. Radiology 1996; 201:798-802.[Abstract/Free Full Text]
  9. Henschke CI, McCauley DI, Yankelevitz DF, et al. Early lung cancer action project: overall design and findings from baseline screening. Lancet 1999; 354:99-105.[CrossRef][Medline]
  10. Sone S, Li F, Yang ZG, et al. Characteristics of small lung cancers invisible on conventional chest radiography and detected by population based screening using spiral CT. Br J Radiol 2000; 73:137-145.[Abstract]
  11. Sone S, Li F, Yang ZG, et al. Results of 3-year mass screening programme for lung cancer using mobile low-dose spiral computed tomography scanner. Br J Cancer 2001; 84:25-32.[CrossRef][Medline]
  12. Boiselle PM, Ernst A, Karp DD. Lung cancer detection in the 21st century: potential contributions and challenges of emerging technologies. AJR Am J Roentgenol 2000; 175:1215-1221.[Free Full Text]
  13. Henschke CI, Yankelevitz DF, Naidich DP, et al. CT screening for lung cancer: suspiciousness of nodules according to size on baseline scans. Radiology 2004; 231:164-168.[Abstract/Free Full Text]
  14. Zelen M, Feinleib M. On the theory of screening for chronic disease. Biometrika 1969; 56:601-614.[Abstract/Free Full Text]
  15. Shapiro S, Goldberg JD, Hutchinson GB. Lead time in breast cancer detection and implications for periodicity of screening. Am J Epidemiol 1974; 100:357-366.[Abstract/Free Full Text]
  16. Flehinger BJ, Kimmel M. The natural history of lung cancer in a periodically screened population. Biometrics 1987; 43:127-144.[CrossRef][Medline]
  17. Henschke CI, Naidich DP, Yankelevitz DF, et al. Early lung cancer action project: initial findings on repeat screening. Cancer 2001; 92:153-159.[CrossRef][Medline]
  18. Vander AJ, Sherman JH, Luciano DS. Human physiology: the mechanisms of body function 6th ed. New York, NY: McGraw-Hill, 1994; 40.
  19. Hasegawa M, Sone S, Takashima S, et al. Growth rate of small lung cancers detected on mass CT screening. Br J Radiol 2000; 73:1252-1259.[Abstract]
  20. Usuda K, Saito Y, Sagawa M, et al. Tumor doubling time and prognostic assessment of patients with primary lung cancer. Cancer 1994; 74:2239-2244.[CrossRef][Medline]
  21. Yankelevitz DF, Kostis WJ, Henschke CI, et al. Overdiagnosis in chest radiographic screening for lung carcinoma: frequency. Cancer 2003; 97:1271-1275.[CrossRef][Medline]
  22. Heelan RT, Flehinger BJ, Melamed MR, et al. Non-small-cell lung cancer: results of the New York Screening Program. Radiology 1984; 151:289-293.[Abstract/Free Full Text]
  23. Frenzen CL, Murray JD. A cell kinetics justification for Gompertz’ equation. SIAM J Appl Math 1986; 46:614-629.[CrossRef]
  24. Castro MA, Klamt F, Grienseisen VA, Grivicich I, Moreira JC. Gompertzian growth pattern correlated with phenotypic organization of colon carcinoma, malignant glioma and non-small cell lung carcinoma cell lines. Cell Prolif 2003; 36:65-73.[CrossRef][Medline]
  25. Calderón CP, Kwembe TA. Modeling tumor growth. Math Biosci 1991; 103:97-114.[CrossRef][Medline]
  26. Eddy DM. Screening for lung cancer. Ann Intern Med 1989; 111:232-237.
  27. Hillman BJ. CT screening: who benefits and who pays. Radiology 2003; 228:26-28.[Free Full Text]
  28. Mahadevia PJ, Fleisher LA, Frick KD, Eng J, Goodman SN, Powe NR. Lung cancer screening with helical computed tomography in older adult smokers: a decision and cost-effectiveness analysis. JAMA 2003; 289:313-322.[Abstract/Free Full Text]
  29. Wisnivesky JP, Mushlin AI, Sicherman N, Henschke C. The cost-effectiveness of low-dose CT screening for lung cancer. Chest 2003; 124:614-621.[Abstract/Free Full Text]



This article has been cited by other articles:


Home page
ChestHome page
P. B. Bach, G. A. Silvestri, M. Hanger, and J. R. Jett
Screening for Lung Cancer: ACCP Evidence-Based Clinical Practice Guidelines (2nd Edition)
Chest, September 1, 2007; 132(3_suppl): 69S - 77S.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Chapman, B. E.
Right arrow Articles by Gur, D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Chapman, B. E.
Right arrow Articles by Gur, D.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE