|
|
||||||||
Breast Imaging |
1 From The Elizabeth Wende Breast Clinic, 170 Sawgrass Dr, Rochester, NY 14620. From the 2001 RSNA scientific assembly. Received January 15, 2003; revision requested March 26; final revision received November 12; accepted January 5, 2004. Address correspondence to K.M.W. (e-mail: kmw@ewbc.com) or S.V.D. (e-mail: sdestounis@ewbc.com).
| ABSTRACT |
|---|
|
|
|---|
MATERIALS AND METHODS: At the authors institution, independent prospective double readings in which the second reader is not blinded to results of the first reading are performed routinely for all mammograms. When cancer is diagnosed, prior mammograms also are reviewed with double reading to determine cancer visibility. Findings are categorized as (a) no evidence of cancer on any prior screening mammogram and patient presents more than 1 year after prior screening, (b) no evidence of cancer on any prior screening mammogram and patient presents with symptoms within 1 year after prior screening (year-interval occult false-negative), or (c) cancer visible. The clinical director separately evaluates each case in the same way. In 2000, 519 histologically proved breast cancers were diagnosed, including 132 for which patients sought a second opinion and FN findings were not tracked. Prior screening mammograms were available in 318 of the other 387 cases. Five radiologists in two reading sessions independently reviewed current and prior mammograms to categorize visible cancers as either threshold or actionable FN findings. Visible cancers deemed actionable by at least three of five readers were analyzed with a commercially available CAD system. FN rates were calculated prior to and after CAD analysis.
RESULTS: Twenty-seven occult and 71 visible cancers were found (total FN findings, 98). Three of five readers considered 52 (73%) of 71 visible cancers actionable. The CAD system correctly marked 37 (71%) of these 52 on prior screening mammograms (19 [65%] of 29 masses, seven [88%] of eight microcalcifications, seven [78%] of nine architectural distortions, and four [67%] of six masses with microcalcifications). The FN rate was 98 (31%) of 318 before CAD and 61 (19%) of 318 after CAD.
CONCLUSION: In this retrospective review of this small subset of cancers, it appears that CAD has the potential to decrease the FN rate at double reading by more than one-third (from 31% to 19%). The CAD system correctly marked 37 (71%) of 52 actionable findings read as negative in previous screening years.
© RSNA, 2004
Index terms: Breast neoplasms, radiography, 08.11 Breast radiography, quality assurance Computers, diagnostic aid
| INTRODUCTION |
|---|
|
|
|---|
Interpretation of the mammogram by two radiologists (double reading) is one method to compensate for perceptual inconsistencies; results from several studies indicate a 4.6%15% increase in sensitivity with use of double as opposed to single reading (2124). Furthermore, cancers identified by a second reader are detected at an earlier stage (22). The additional cost of double reading, however, prohibits its wide use. Even with use of double reading, false-negative findings on mammograms persist (15,17).
Another method to reconcile perceptual problems is computer-aided detection (CAD) (2528). Previous investigations of CAD in the interpretation of screening mammograms demonstrated as much as a 19.5% increase in cancer detection with a single reading (17,29). To our knowledge, no study to date has determined the value of CAD in the setting of double reading. The purpose of this retrospective pilot study was to evaluate the role of a CAD system in reducing the rate of false-negative findings on screening mammograms that have been double read as normal.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Double reading of diagnostic mammograms is completed as follows: The assigned radiologist evaluates the patient and, if the radiologist discovers an abnormality, directs any subsequent tests toward deciding the course of action. All testing is completed online. The second reader independently reviews the mammogram and consults with the assigned first reader. The primary goal of the second reader in the diagnostic setting is to screen for problems overlooked by the first reader, not to reevaluate the problematic area, although opinions may be sought.
False-Negative Rates
We have tracked outcomes rigorously since our clinic began operation in 1976. To determine false-negative rates, we include not only cancers diagnosed within 1 year of a mammogram interpreted as negative but also cancers that are visible on any prior mammogram. We routinely perform a retrospective review of prior mammograms (diagnostic or screening) for all patients in whom cancer is diagnosed. When cancer is diagnosed, the two initial readers review both the current mammogram and prior screening mammograms to determine whether cancer is visible on any prior study. Cancers that are visible are documented. If the cancer is not visible but less than 1 year has elapsed since the most recent prior screening, we consider the cancer a year-interval missed lesion and document it as an occult false-negative finding. Separately and independently, the director of the clinic (W.L.Y.) also evaluates all cancers in the same manner. Any cancer deemed visible on mammograms obtained in prior years is documented, whether one, two, or three radiologists agree or not.
Twice a year, in a group conference, five radiologists reevaluate current and prior mammograms of interval occult cancers and visible cancers, all of which are considered false-negative findings, to increase each radiologists experience and knowledge base. For purposes of this study, these group conferences were replaced by two reading sessions performed by each of five radiologists (S.V.D., P.D.N., W.L.Y., E.B., M.L.Z.) to subcategorize the visible cancers as threshold false-negative or actionable false-negative findings. Threshold cancers are defined as pathologically proved cancers that were found at diagnostic or screening mammography in the current year and that were visible on a previous screening mammogram but were below the threshold for concern. Either these cancers were not acted on, or they were judged benign or probably benign on the basis of work-up (which may have included the acquisition of additional mammographic views, as well as physical examination, ultrasonography, fine-needle aspiration biopsy, and/or needle core biopsy). Threshold false-negative findings are considered to result from errors in judgment, biopsy technique, histopathologic evaluation, or technical limitations of the mammographic examination. Actionable findings include any pathologically proved cancer that, after independent review, was considered by three of five radiologists to be both evident and actionable on any prior screening study. We defined as actionable those cases in which mammographic evidence was strong enough to have warranted further work-up but for which no work-up was completed in any prior year. Actionable false-negative findings are considered to result from errors in perception.
Selection of Cases for CAD System Analysis
Institutional review board approval was not required for this study. Informed consent is not required by our institution for retrospective review of data. In 2000, a total of 519 histologically proved cancers (Table 1) were diagnosed at our facility (175 at diagnostic and 344 at screening mammography). In 132 of these 519 cases, the patient sought a second opinion, and false-negative findings in these 132 cases were not tracked. Prior screening mammograms had been obtained at our facility for 318 (82%) of the other 387 cases. These 318 cancers were detected either at diagnostic imaging performed because of symptoms that occurred within 1 year after prior screening (n = 105) or at follow-up routine screening mammography (n = 213). Prior screening mammograms in some cases predated the current mammograms by more than 1 year. The initial evaluation process, with the double reading and with review by the director of the clinic, yielded findings of visible disease on prior screening mammograms for 71 cancers (Table 2). Five radiologists (S.V.D., P.D.N., W.L.Y., E.B., M.L.Z.) independently reviewed the current and prior mammograms of these 71 cancers. Three of the five radiologists determined independently that 52 (73%) of the 71 mammograms (mean patient age, 65 years; age range, 4289 years) showed abnormalities that could have been acted on (actionable false-negative findings). Our study was focused on the 52 cancers that were actionable on any prior screening study, regardless of whether they had been detected because of symptoms or at routine screening.
|
|
CAD Analysis
Current mammograms and prior screening mammograms of the 52 actionable cancers were analyzed by using a CAD device (Image Checker V2.2; R2 Technology, Sunnyvale, Calif). The current and previous mammograms were reviewed by using a viewer (RAD-x 604; R2 Technology) equipped with two 9-inch gray-scale monitors, which displays the scanned mammograms with CAD analysis results. The display reveals the location of CAD marks and indicates morphologic type. It is not intended for soft-copy interpretation.
Overall, a cancer was counted as detected if the CAD system marked the lesion site with the appropriate mark on either view of the current or any prior mammogram. The CAD system sometimes marked the cancer on one but not all prior mammograms, on a prior mammogram but not the current mammogram, or on the current mammogram but no prior mammogram.
The following parameters were calculated and documented: number of cancers marked with appropriate marks on the current mammogram, number of cancers marked with appropriate marks on mammograms obtained 1 year or less before the current mammogram, and the number of cancers marked with appropriate marks on mammograms obtained more than 1 year prior to the current mammogram.
Distribution of morphologic presentation among marked cancers also was recorded. The total number of marks per case and the number of marks that correctly indicated cancer per case were documented.
Determination of False-Negative Rates
False-negative rates were determined, prior to CAD analysis of the 52 actionable false-negative findings, by using the total number of breast cancers diagnosed in 2000 (excluding those diagnosed through a second opinion) for which prior mammograms obtained at our clinic were available. False-negative rates were calculated as follows: The number of all (occult, threshold, and actionable) false-negative findings was divided by the total number of cancers, and the contribution of each category (occult, threshold, or actionable) to the false-negative rate was divided by the total number of cancers. In addition, we determined the false-negative rate by using the guidelines issued by the ACR (29) for a practice audit, which includes all occult false-negative findings and diagnostic (year-interval) threshold or actionable false-negative findings only, but excludes actionable and threshold false-negative findings diagnosed more than 1 year after screening. After the CAD system analysis of the 52 actionable findings was complete, the false-negative rates were recalculated with the assumption that all CAD-marked cancers could have been detected on a prior mammogram.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Assessment of false-negative findings, especially when used as a learning tool, can help expand the boundaries of what is detectable. Accurate determination of false-negative rates is complicated. Bias inherent in retrospective review can result in rates that are under- or overestimated. Variation in audit practices and discrepancy in what constitutes a false-negative finding also result in reporting inaccuracies. Further discrepancies in false-negative rates may occur because previously screened patients with a later diagnosis of breast cancer may obtain subsequent screening and/or diagnostic work-up at different facilities or may proceed directly to the surgeon; in such cases, radiologists are predominantly not informed of false-negative findings. Because Rochester, NY, has a stable population, our clinic has multiple previous mammograms for most patients. This permits us to track false-negative findings more closely than most clinics can. For 2000, we have at least one prior mammogram for 318 (82%) of 387 cancers, a fact that allowed us to perform a more accurate assessment of the false-negative rate. In addition, we have established long-standing mechanisms in our community that more effectively enable us to obtain the results of pathologic analysis even when we have not initiated the biopsy.
The ACR (31) defines a false-negative finding as "diagnosis of cancer within 1 year of a mammographic examination with normal or probably benign findings (BI-RADS [Breast Imaging and Reporting Data System] category 1, 2, and 3)." The Final Rules of the Mammography Quality Standards Act (32) require a systematic practice audit. Although the Mammography Quality Standards Act does not define how a facility should perform a practice audit, it provides clear instruction about the reporting of false-negative mammographic findings: "Facilities must also include in their audit any patients that they become aware of who were subsequently found to have cancer that was not detected through their mammogram."
Herein lies the greatest debate over false-negative values. If the ACR definition of false-negative findings is applied, our clinics false-negative rate for 2000 is 41 (13%) of 318 cancers (Table 6), including 27 (9%) that were mammographically occult and 14 (4%) that were year-interval missed cancers (five threshold and nine actionable false-negative findings) visible on prior mammograms.
However, our clinic reviews the prior mammograms of all patients with cancer, whether the mammogram was obtained less than or more than 1 year prior to cancer diagnosis. Using our method of review for false-negative findings, we identified another 57 (18%) of 318 cancers (14 threshold and 43 actionable false-negative findings) that were retrospectively visible on previous screening mammograms. We believe that this is the best method to determine the true false-negative rate. Admittedly, there is some subjectivity in deciding whether the earlier appearance of tumors is actionable. If the goal is to enhance the interpretive proficiency of the radiologist and to decrease perceptual deficiencies, then the information obtained by reviewing these additional cancer cases must also be considered.
Since 1995, the staff at our clinic have performed a double reading of all mammograms. An internal audit for 19962000 revealed a 9% increase in the number of detected cancers as a result of the second reading (86 of 938 cancers detected in those years). This result is similar to previously reported statistics of 4.6%15% (2124). Because our office and staff are located at a single centralized facility, two radiologists are always available for double reading. Our recall rate is higher than those at facilities at which only one physician interprets the mammogram, because either of our radiologists can recall the patient. However, the current lack of trained mammographers is stretching our staff to its limits. Whether CAD can replace the second reader remains unclear, and the question is in need of further research.
It appears from our retrospective review that CAD has the potential to decrease the false-negative rate. Fifty-two cancers were considered by our facility to be false-negative findings, although 43 of the 52 were detected at follow-up screening. Thirty (58%) of these 52 cancers were considered minimal cancers (ductal carcinomas in situ or
10-mm invasive carcinomas); however, three (7%) of 40 patients who underwent lymph node dissection had lymph node metastases. The CAD system correctly marked 37 (71%) of 52 cancers on prior mammograms, with 11 (21%) of the 52 cancers marked on mammograms acquired more than 1 year prior to diagnosis. The system also correctly marked all three cancers with lymph node involvement on prior mammograms. In one case of lymph node metastasis, the CAD system marked the cancer on a mammogram acquired more than 2 years prior. Twenty-eight (76%) of the 37 cancers detected by the CAD system were invasive and had a mean size of 12 mm (range, 525 mm) at detection, compared with a mean size of 8 mm (range, 217 mm) on the oldest CAD systemmarked prior mammogram.
It is impossible to determine retrospectively whether the CAD system would prompt either radiologist in the prospective double reading to detect a cancer. Also, it is impossible to expect that increases in detection rates with CAD would come without increased cost. Of 218 CAD system marks, only 75 designated cancers. It is likely that false marks would lead to an increase in recall rates, radiologists workload, and overall operating expenses. However, Warren-Burhenne et al (17), in their retrospective review, found an overall decrease in recall rates (<1%), and Freer and Ulissey (29), in their prospective study, found an increase in recalls of a little more than 1%, no change in positive predictive value, and an overall increase in sensitivity of 19.5%. These results may indicate the quality of CAD system marks and/or the ability of the radiologist to distinguish false-positive marks from high-quality actionable marks.
It is also possible that CAD may decrease the number of findings in the threshold false-negative cohort. Because of subjective interpretation and wide variation in recall rates, CAD may influence recall of borderline cases. This can only be determined in a prospective manner.
In this retrospective review of this small subset of cancers, it appears that CAD has the potential to decrease the false-negative rate at double reading by more than one-third (from 31% to 19%). The CAD system correctly marked 37 (71%) of 52 actionable findings read as negative in previous screening years. What could not be determined retrospectively were the associated changes in recall rates and the effect on the positive predictive value. A prospective study is warranted and is now under way at our institution to measure and evaluate the effect of the addition of CAD to independent double interpretation of screening mammograms.
| FOOTNOTES |
|---|
Author contributions: Guarantor of integrity of entire study, S.V.D.; study concepts and design, all authors; literature research, S.V.D., K.M.W.; clinical studies, S.V.D., P.D.N., W.L.Y., E.B., M.L.Z.; data acquisition and data analysis/interpretation, all authors; manuscript preparation and definition of intellectual content, all authors; manuscript editing, S.V.D., W.L.Y., K.M.W.; manuscript revision/review and final version approval, all authors
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Gromet Comparison of Computer-Aided Detection to Double Reading of Screening Mammograms: Review of 231,221 Mammograms Am. J. Roentgenol., April 1, 2008; 190(4): 854 - 859. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Kakeda, Y. Korogi, H. Arimura, T. Hirai, S. Katsuragawa, T. Aoki, and K. Doi Diagnostic Accuracy and Reading Time to Detect Intracranial Aneurysms on MR Angiography Using a Computer-Aided Diagnosis System Am. J. Roentgenol., February 1, 2008; 190(2): 459 - 465. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Georgian-Smith, R. H. Moore, E. Halpern, E. D. Yeh, E. A. Rafferty, H. A. D'Alessandro, M. Staffa, D. A. Hall, K. A. McCarthy, and D. B. Kopans Blinded Comparison of Computer-Aided Detection with Human Second Reading in Screening Mammography Am. J. Roentgenol., November 1, 2007; 189(5): 1135 - 1141. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. F. Brem Blinded Comparison of Computer-Aided Detection with Human Second Reading in Screening Mammography: The Importance of the Question and the Critical Numbers Game Am. J. Roentgenol., November 1, 2007; 189(5): 1142 - 1144. [Full Text] [PDF] |
||||
![]() |
R. L. Ellis, A. A. Meade, M. A. Mathiason, K. M. Willison, and W. Logan-Young Evaluation of Computer-aided Detection Systems in the Detection of Small Invasive Breast Carcinoma Radiology, October 1, 2007; 245(1): 88 - 94. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Skaane, A. Kshirsagar, S. Stapleton, K. Young, and R. A. Castellino Effect of Computer-Aided Detection on Independent Double Reading of Paired Screen-Film and Full-Field Digital Screening Mammograms Am. J. Roentgenol., February 1, 2007; 188(2): 377 - 384. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. H. Taplin, C. M. Rutter, and C. D. Lehman Testing the effect of computer-assisted detection on interpretive performance in screening mammography. Am. J. Roentgenol., December 1, 2006; 187(6): 1475 - 1482. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Ko, M. J. Nicholas, J. B. Mendel, and P. J. Slanetz Prospective assessment of computer-aided detection in interpretation of screening mammography. Am. J. Roentgenol., December 1, 2006; 187(6): 1483 - 1491. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. R. Pai, N. E. Gregory, A. E. Swinford, and M. Rebner Ductal Carcinoma in Situ: Computer-aided Detection in Screening Mammography Radiology, December 1, 2006; 241(3): 689 - 694. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. J. Gilbert, S. M. Astley, M. A. McGee, M. G. C. Gillan, C. R. M. Boggis, P. M. Griffiths, and S. W. Duffy Single Reading with Computer-aided Detection and Double Reading of Screening Mammograms in the United Kingdom National Breast Screening Program Radiology, October 1, 2006; 241(1): 47 - 53. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Dua, C. M. Isacke, and G. P.H. Gui The Intraductal Approach to Breast Cancer Biomarker Discovery J. Clin. Oncol., March 1, 2006; 24(7): 1209 - 1216. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Mouchawar, S. Taplin, L. Ichikawa, W. E. Barlow, A. M. Geiger, S. Weinmann, J. Gilbert, M. M. Manos, and M. U. Yood Late-Stage Breast Cancer Among Women With Recent Negative Screening Mammography: Do Clinical Encounters Offer Opportunity for Earlier Detection? J Natl Cancer Inst Monographs, November 1, 2005; 2005(35): 39 - 46. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Hirai, Y. Korogi, H. Arimura, S. Katsuragawa, M. Kitajima, M. Yamura, Y. Yamashita, and K. Doi Intracranial Aneurysms at MR Angiography: Effect of Computer-aided Diagnosis on Radiologists' Detection Performance Radiology, November 1, 2005; 237(2): 605 - 610. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. V. Ghate, M. S. Soo, J. A. Baker, R. Walsh, E. I. Gimenez, and E. L. Rosen Comparison of Recall and Cancer Detection Rates for Immediate versus Batch Interpretation of Screening Mammograms Radiology, April 1, 2005; 235(1): 31 - 35. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |