Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


DOI: 10.1148/radiol.2411051092
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Duffy, S. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Duffy, S. W.
(Radiology 2006;241:47-53.)
© RSNA, 2006


Breast Imaging

Single Reading with Computer-aided Detection and Double Reading of Screening Mammograms in the United Kingdom National Breast Screening Program1

Fiona J. Gilbert, FRCR, Susan M. Astley, PhD, Magnus A. McGee, BSc, Maureen G. C. Gillan, PhD, Caroline R. M. Boggis, FRCR, Pamela M. Griffiths, BA and Stephen W. Duffy, MSc

1 From the Department of Radiology, University of Aberdeen, Lilian Sutton Bldg, Foresterhill, Aberdeen, Scotland, AB25 2ZD (F.J.G., M.G.C.G.); Department of Imaging Science and Biomedical Engineering, University of Manchester, Manchester, England (S.M.A., P.M.G.); Department of Public Health and General Practice, Christchurch School of Medicine, Christchurch, New Zealand (M.A.M.); Nightingale Center, Withington Hospital, Manchester, England (C.R.M.B.); and Department of Epidemiology, Mathematics and Statistics, Wolfson Institute of Preventive Medicine, London, England (S.W.D.). Received June 29, 2005; revision requested August 25; revision received October 5; accepted October 19; final version accepted February 10, 2006. Supported by Cancer Research UK and the UK NHS Breast Screening Program. Address correspondence to F.J.G. (e-mail: f.j.gilbert{at}abdn.ac.uk).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
Purpose: To retrospectively determine if the use of a computer-aided detection (CAD) system can improve the performance of single reading of screening mammograms to match that of double reading in the United Kingdom.

Materials and Methods: Local research ethics committee approval was obtained; informed consent was not required. This study included a sample of 10 267 mammograms obtained in women aged 50 years or older who underwent routine screening at one of two breast screening centers in 1996. Mammograms that were double read in 1996 were randomly allocated to be re-read by eight different radiologists using CAD. The cancer detection and recall rates from double reading and single reading with CAD were compared. Statistical significance and confidence intervals were calculated with the McNemar test to account for the matched nature of the data.

Results: Single reading with CAD led to a cancer detection rate that was significantly (P = .02) higher than that achieved with double reading: 6.5% more cancers were detected by means of single reading with CAD than by means of double reading. However, the recall rate was higher for single reading with CAD than for double reading (8.6% vs 6.5%, respectively; P < .001). This was equivalent to relative increases of 15% and 32% in the cancer detection and recall rates, respectively.

Conclusion: Single reading with CAD leads to an improved cancer detection rate and an increased recall rate.

© RSNA, 2006


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
In the United Kingdom, breast cancer screening is performed as part of a nationwide screening program organized by the government-funded National Health Service Breast Screening Program (NHSBSP) (13). This program offers free mammography at 3-year intervals to women aged 50–69 years.

The NHSBSP has recently been extended: The upper inclusion age has been increased from 64 years to 69 years, and two images (oblique and craniocaudal views) of each breast are now obtained at every examination. When the program was started in 1988, two views of each breast were obtained at the first (prevalent) screening session, and single views were obtained thereafter. This has increased the workload of readers (4), and additional radiologists and radiographers are needed to support and sustain these changes (5). Originally, mammograms were read by one radiologist (hereafter, single reading). However, in most centers, mammograms are now read independently by two radiologists (hereafter, double reading), as cancer detection rates are 5%–15% higher with double reading than with single reading (611).

A computer-aided detection (CAD) system that uses prompts to attract reader attention to suspicious features on mammograms (1214) could conceivably improve the performance achieved with single reading so that it matches the performance achieved with double reading. The potential of CAD to enable detection of additional cancers and detection of cancers at an earlier stage than they would be detected with single reading has been demonstrated by the findings of several retrospective studies (1518). However, evidence of a benefit from the use of CAD in a prospective screening setting is both limited and conflicting. Freer and Ulissey (12) reported a 19.5% increase in the cancer detection rate and a parallel 18.5% increase (from 6.5% to 7.7%) in the recall rate when they used CAD to assess 12 860 mammograms. In contrast, in a time series study of 115 571 mammograms, no difference in cancer detection or recall rates was reported after CAD was introduced (19). Recall rates for 14 817 mammograms after the introduction of CAD were compared with historic data for 23 682 mammograms; the introduction of CAD did not significantly affect the recall rate (16).

It is not known if the sensitivity achieved with single-reading CAD is comparable to that achieved with double-reading CAD. The findings of retrospective reviews in which prior mammograms from cancer cases were used suggest that single reading with CAD could yield the same performance as double reading, provided all correct prompts were recalled (18). On the other hand, the performance of double reading of mammograms was better in a simulated setting in which the performance data of individual readers were compared (20). Two other studies (21,22) revealed similar sensitivity and specificity between a single reading and a simulated double reading; however, these studies were not conducted in a screening environment with a large number of normal cases. This limits the possibilities for extrapolating the results to those in a screening setting in which only one in 100–200 mammograms shows cancer.

CAD systems generally have high sensitivity but only moderate specificity. Large numbers of prompts are generated; thus, the reader must decide which prompts require action and which should be dismissed. This could reduce the effectiveness of prompting when CAD is used as part of a routine screening program in which most of the mammograms are normal. The reader must learn to correctly dismiss the prompts that mark benign lesions or normal tissue without dismissing the prompts that mark cancers.

A prospective randomized trial would yield the most information pertaining to the role of CAD in breast screening. However, in the United Kingdom, it is first necessary to demonstrate that the performance of single reading with CAD is no worse than the performance of double reading, which is the current standard practice. This ensures that women will not be disadvantaged by the use of CAD. The Computer Aided Detection Evaluation Trial, or CADET, was therefore established to compare double reading with single reading with CAD. Thus, the purpose of our study was to retrospectively determine whether CAD can improve the performance of single reading to the level achieved with double reading in the United Kingdom.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
Two ImageChecker M1000, version 5.0, CAD systems were loaned to us by R2 Technology (Sunnyvale, Calif) for a 2-year period. No other support was given, and the authors had full control of the study data and information submitted for publication.

Case Selection
This trial was designed to compare the performances (cancer detection rate and recall rate) of single reading with CAD and double reading in a cohort of cases. Sufficient time needed to elapse between double reading and single reading with CAD so that interval cancers and subsequent screening-detected cancers could be identified. The Computer Aided Detection Evaluation Trial was an equivalence study in terms of sensitivity (Appendix); it was designed so that a 95% confidence interval (CI) on the difference between cancer detection rates would rule out the possibility that the performance of single reading with CAD was more than 10% worse than the performance of double reading. This required re-reading 14 000–15 000 mammograms that depicted approximately 210 cancers. All the cancers were retained, and almost 5000 mammograms from women with no subsequent diagnosis of breast cancer were discarded randomly. This yielded a data set of 10 267 mammograms, of which 236 (2.3%) depicted cancer and 10 031 were normal; this data set was more consistent with the case mix encountered in a routine screening setting than were the data sets of most studies.

Mammograms were sampled from all routine screening mammograms that were obtained in 1996 in women aged 50 years or older and were double read. Two NHSBSP screening centers (Northeast Scotland Breast Screening Center, Aberdeen, Scotland, and Nightingale Center) participated in this study. Cancers had histologic evidence of malignancy present at biopsy, or in rare cases, at cytology. Study cancers were classified into three groups: (a) those detected at screening in 1996 (screening-detected cancers), (b) those detected in the 3 years between scheduled screenings (interval cancers), and (c) those detected at screening in 1999 (subsequent screening-detected cancers). Cancers detected after screening in 1999 were identified and termed poststudy cancers. Screening-detected cancers were identified from records held by the central screening system and confirmed by the two centers. Interval cancers were identified by means of record linkage undertaken by local cancer registries; the two centers confirmed these cancers. Local research ethics committee approval was obtained, informed consent was not required, and all mammograms were anonymized.

Digitization and CAD Analysis
The anonymized mammograms were digitized, and the results of CAD image analysis were displayed on a flat-panel display screen as low-spatial-resolution images overlaid with markers that indicated areas of potential abnormalities. The image analysis algorithms generated prompts for masses (asterisks) and microcalcifications (triangles). Regions in which both a mass and a microcalcification were depicted were marked with a composite "malc" (ie, mass and calcification) marker. In all cases, the size of the prompt was related to the likelihood of cancer, as determined by the algorithms. The PeerView (R2 Technology) facility enabled readers to view annotated enlargements of regions that contained prompts.

Readers and CAD Training
Four readers from each center participated in the study. All eight readers (F.J.G., C.R.M.B.) met the quality assurance standards of the NHSBSP (23) and read an average of more than 5000 mammograms per year. Readers at the Northeast Scotland Breast Screening Center had 2–13 years of screening experience; at the Nightingale Center, readers had 2–15 years of screening experience. The readers who interpreted mammograms in 1996 met the same NHSBSP standards and had 1–6 years of experience at the Northeast Scotland Breast Screening Center and 7–8 years of experience at the Nightingale Center. Four of the eight readers in our study (F.J.G., C.R.M.B.) participated in double reading of mammograms in 1996. The study coordinator (M.A.M.) ensured that these readers did not read mammograms that they read previously in 1996.

The four readers at each center underwent a 2-month training period that consisted of an initial training session taught by representatives of R2 Technology; this was followed by consolidation and practice sessions in which six training sets of 75–100 cases were used. Training sets were completely separate from cases used in the study. After each session, readers were given access to truth data that allowed them to assess and improve their performance with CAD. In the initial sets, 25% of mammograms showed cancer. This percentage was progressively reduced to 5% to train the readers to dismiss prompts on normal mammograms in an environment with a low cancer rate, such as the screening setting (24).

Reading Procedure
Screening mammograms were randomly allocated to be read by a radiologist who had not been recorded as the first or second reader in 1996. Each mammogram was first viewed by the reader, and abnormalities (if any) were recorded on a pro forma data sheet, along with a recommendation to recall the patient for further assessment or for the patient to return in 3 years for routine screening. Prior mammograms were hung if they had been used at the time of the original double reading. The position and type of abnormality were marked, and the degree of suspicion was scored on a five-point scale (1, normal or benign; 2, probably benign; 3, indeterminate; 4, suspicious; and 5, malignant). The reader then accessed the prompt image and reviewed the mammograms to further examine any areas with CAD prompts. Any additional findings, along with another score and a recommendation for future imaging, were recorded. Readers were aware that this reading procedure differed from a routine screening procedure in that the case mix contained a higher proportion of cancer cases and the recommendation for recall would be recorded but no action would be taken.

The reading procedures that were used in 1996 at each screening center were replicated for individual readers using CAD. In one center, a patient was recalled if either reader recommended that she be recalled. In our study, each reader decided whether to recall a patient. In the other center, mammograms were scored with a five-point scale. Patients were recalled if either reader assigned a score of 3 or higher to a mammogram. If both readers assigned a score of 1 to a mammogram, the patient was not recalled. If a score of 2 was assigned by either reader, the case was discussed by the readers involved or with another reader, and they decided whether to recall the patient. In our study, cases with a score of 2 were discussed with another reader to determine whether to recall the patient.

Statistical Analysis
The primary outcome measures were cancer detection rate and recall rate. We also compared the overall recall rate and the recall rate of normal cases (false-positive findings) for a single reader using CAD with the recall rate of the original two readers. Thus, the detection rate of a single reader using CAD was calculated by using the screening-detected cancers, the interval cancers, and the subsequent screening-detected cancers. Poststudy cancers were not included in the first analysis. Sensitivity analysis was performed thereafter and included these cancers. Statistical significance and CIs were calculated with the McNemar test to take into account the matched nature of the data (25). Stata statistical software (version 8.0; Stata, College Station, Tex) was used (26).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
Original recall details were retrieved from the database and relevant mammograms were successfully digitized and single read with CAD for 10 096 (98.3%) of the original 10 267 mammograms identified, including 230 (97.5%) of the 236 study cancers (Figure, Table 1). The age distribution for the study population with full history was as follows: 3461 (34%) patients were aged 50–54 years, 2860 (28%) were aged 55–59 years, 3397 (34%) were aged 60–64 years, and 378 (4%) were aged 65 years or older. Of the original 10 267 mammograms, 3515 (34%) were obtained in patients aged 50–54 years, 2912 (28%) were obtained in patients aged 55–59 years, 3455 (34%) were obtained in patients aged 60–64 years, and 385 (4%) were obtained in patients aged 65 years or older.


Figure 1
View larger version (12K):
[in this window]
[in a new window]
[Download PPT slide]
 
Flow chart shows outcomes of study mammograms.

 

View this table:
[in this window]
[in a new window]

 
Table 1. Tumor Features

 
Single reading with CAD enabled us to detect 49.1% of cancer cases, whereas only 42.6% of cancer cases were detected with double reading (Table 2). The mean difference between the detection rates for cancer cases was 6.5% (95% CI: 1.1%, 11.9%; P = .02) (Table 3). Overall recall rates were 8.6% for single reading with CAD and 6.5% for double reading (P < .001). This was equivalent to relative increases of 15% and 32% in cancer detection and recall rates, respectively. The results were similar at both centers.


View this table:
[in this window]
[in a new window]

 
Table 2. Recall Rates

 

View this table:
[in this window]
[in a new window]

 
Table 3. Recall Rates for Cancer Cases

 
When considering only cancer cases, there was 85% agreement (195 of 230 cases) between the two reading techniques (Table 3). Of the 35 cases in which there was disagreement, 10 were recalled at double reading but not at single reading with CAD and 25 were recalled at single reading with CAD but not at double reading. For normal cases (Table 4), the recall rate was significantly higher for single reading with CAD than for double reading (7.7% vs 5.7%, respectively; P < .001), despite the small average difference in specificity of 2.0% (95% CI: 1.3%, 2.7%). For normal cases, there was 91% agreement between the two reading techniques.


View this table:
[in this window]
[in a new window]

 
Table 4. Recall Rates for Normal Cases

 
Table 5 shows the matched-pair comparison for all cancers diagnosed between 1996 and 2003, including interval cancers diagnosed after screening in 1999. There was an 84% agreement rate between double reading and single reading with CAD. The rate of detection of these cancers was significantly (P = .001) higher with single reading with CAD than with double reading (40.0% vs 32.7%, respectively). The difference between detection rates was 7.3% (95% CI: 2.6%, 12.0%).


View this table:
[in this window]
[in a new window]

 
Table 5. Cancers Diagnosed between 1996 and 2003

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
A 6.5% difference in the detection rate of tumors in favor of single reading with CAD was observed; the lower point of the 95% CI was 1.1%. This indicates that, in terms of sensitivity, the performance of single reading with CAD was better than that of double reading (albeit with a higher recall rate).

The recall rate was 8.6% for single reading with CAD and 6.5% for double reading. In effect, this difference was due to reduced specificity for single reading with CAD. Recall rates in the United Kingdom in 1996 and 1997 were 6.7% in Scotland and 4.9% in England. In 2002 and 2003, the recall rate was 6.1% in Scotland and 5.2% in England. The observed increase in the recall rate for single reading with CAD is not excessive when one considers that cancer detection rates in the United Kingdom and the United States are similar but that the recall rate in the United Kingdom is approximately half that in the United States (1). Published reports of single reading with CAD in a prospective setting indicate that there is an increase in recall rate (12) or no significant change (16,19).

We believe the strength of our study was its large sample size that included almost all subsequent screening-detected cancers and interval cancers. This allowed the pathologic finding–based reference standard to be used to determine which cases were cancers. In our study, the ratio of cancer cases to normal cases was closer to the ratio in a screening situation than the ratio in many previous retrospective studies (9,16,20,21,27); this resulted in a better indication of how the reader might behave in a prospective setting when he or she would need to ignore a large number of false prompts on normal screening mammograms.

Most evaluations of CAD have been conducted in the United States, where the screening population and program differ considerably from those in the United Kingdom: These studies have yielded higher recall rates, different age ranges and screening intervals, and variable ascertainment of interval cancers (1,3). The success of CAD in a screening program is highly dependent on the specificity of the prompts and the effect that these prompts have on reader behavior and performance. A relatively large number of false prompts may lead to reader fatigue and reduced performance. Readers may begin to ignore both true and false prompts in a situation in which the cancer rate is low, such as routine screening. Readers also need to avoid becoming too reliant on CAD prompting or being falsely reassured of the absence of cancer if there are no CAD prompts (28). If we accept that the cancer detection rate is highest for double reading with arbitration by a third reader (6), it may be that use of arbitration for equivocal cases read with CAD would permit a high sensitivity to be achieved, without compromising specificity, and recall rates with CAD could be kept at a level acceptable to the NHSBSP.

A limitation of our study was that approximately 70% of the cases were single-view mammograms. This does not reflect current practice, in which two-view mammography is the standard in the majority of screening programs. It has been shown that CAD accuracy is less for single-view mammography than for two-view mammography; however, this should not have affected the overall results of our study.

Further limitations of our study were its retrospective design and the difference in experience levels between the readers in this study and those in the original reading exercise. Single reading with CAD was performed in 2003 and 2004, whereas the original double reading was performed in 1996. Although the range of experience of the eight readers in our study was comparable to that of the readers who read mammograms in 1996, reader performance may have improved during this period, and this could be partially responsible for the better performance observed here for single reading with CAD. Standardized detection rates (ie, the ratio of the number of invasive cancers detected to the number of invasive cancers expected in the age distribution of the population) in the United Kingdom in 2003 were 1.35 at first (prevalent) screening and 1.18 at subsequent screening (29); corresponding figures in 1996 and 1997 were 1.17 and 0.94, respectively. This suggests a 15% relative improvement in cancer detection for prevalent screening and a 26% improvement for subsequent screening. Improvements in image quality, the increasing use of two views (3032), and higher background cancer incidence (33) probably account for most of the observed increase in the cancer detection rate. There may have been a modest improvement in reader performance, but this was likely to have been less than the 15% improvement (from 42.6% to 49.1%) observed for single reading with CAD. Thus, the observed improvement may correspond to a true equivalence of the two detection regimens rather than to an advantage for single reading with CAD. It remains unlikely, however, that single reading with CAD is actually inferior to double reading.

In addition, re-reading differed from routine screening in that the readers were aware of the cancer-enriched case mix and that their decisions would not have any clinical implications. Both factors could have led to an increase in the recall rate that could have contributed to an increase in the cancer detection rate. Readers are aware of the adverse psychological consequences of recall; therefore, they try to keep recall to a minimum, without missing cancers. However, in our study, readers knew that the recall would not actually happen, and knowing this may have caused them to increase their recall rate subconsciously.

In conclusion, our results show that the performance of single reading with CAD is equivalent to the performance of double reading. There was a slight increase in the recall rate that may have been caused by the additional cancers in the study population. Double reading is the normal practice in the NHSBSP; however, it would now be acceptable and ethical to undertake a randomized controlled trial to determine whether diagnostic performance is maintained with CAD in a prospective setting.


    APPENDIX: SAMPLE SIZE CALCULATION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 
The proposed sample size was estimated with a likely cancer detection rate of 6.39 cases per 1000 for screening-detected cancers, 2.90 cases per 1000 for interval cancers, and 5.60 cases per 1000 for subsequent screening-detected cancers. In a review of incident round cancers, 19% were false-negative findings from previous screening sessions, 28% showed minimal signs of cancer, and 53% were new tumors (34). Since we do not know what might have been detected if CAD had been used in 1996, the three cancer detection rates were included in the incident detection rate. Adding the cancer detection rates for screening-detected cancers, interval cancers, and subsequent screening-detected cancers yields a total of 14.89 cancers per 1000 mammograms to be re-read. Assuming 80% agreement between double reading and single reading with CAD, the null situation of equal sensitivity (although not invariably to the same tumors) yields the percentages in Table A1. If we assume equivalence of sensitivity to be no more than an absolute difference of 10% in detection rates, this would require a trial large enough for the CI to rule out the percentages in Table A2.


View this table:
[in this window]
[in a new window]

 
Table A1. Expected Results for Absolute Equivalence

 

View this table:
[in this window]
[in a new window]

 
Table A2. Difference between Methods Designed to be Ruled out with 95% CI

 
This would require 42 disagreements between the strategies (ie, 210 cancers and 14 104 mammograms) (35). We initially intended to include 15 000 mammograms in our study, but we were assigned resources to re-read only 10 000 mammograms. Thus, we planned to re-read the normal mammograms from the group of 10 000 but to include all the cancers (n = 236) from the group of 15 000 and thereby enrich the case mix with 2.3% more cancers. Radiologists performing the re-reading were aware of the enriched case mix.


    ADVANCE IN KNOWLEDGE
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 


    ACKNOWLEDGMENTS
 
The authors thank Heather E. Deans, FRCR; Karen A. Duncan, FRCR; Geeta V. Iyengar, FRCR; and Elspeth L. Singleton of the Northeast Scotland Breast Screening Center and Nicola B. Barr, MRCS; Ursula M. Beetles, FRCR; M.A. Griffiths, DCR; Anil K. Jain, FRCR; Jill Johnson, HDCR; Rita F. Roberts, DCR; and Mary Wilson, FRCR, of the Nightingale Center for assistance in data collection. We also thank the Scottish Information and Statistics Division, the Scottish Cancer Registry, the Manchester Cancer Registry, and the staff of the Northeast Scotland Breast Screening Center and the Nightingale Center for their assistance.


    FOOTNOTES
 

Abbreviations: CAD = computer-aided detection • CI = confidence interval • NHSBSP = National Health Service Breast Screening Program

Authors stated no financial relationship to disclose.

Author contributions: Guarantor of integrity of entire study, F.J.G.; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; manuscript final version approval, all authors; literature research, F.J.G., S.M.A., M.G.C.G., C.R.M.B.; clinical studies, F.J.G., M.A.M., C.R.M.B.; experimental studies, M.A.M.; statistical analysis, M.A.M., P.M.G., S.W.D.; and manuscript editing, all authors


    References
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 APPENDIX: SAMPLE SIZE...
 ADVANCE IN KNOWLEDGE
 References
 

  1. Smith-Bindman R, Chu PW, Miglioretti DL, et al. Comparison of screening mammography in the United States and the United Kingdom. JAMA 2003;290:2129–2137.[Abstract/Free Full Text]
  2. Blanks RG, Moss S, Patnick JP. Results from the United Kingdom NHS breast screening programme 1994–1999. J Med Screen 2000;7:195–198.[Abstract/Free Full Text]
  3. Shapiro S, Coleman EA, Broeders M, et al. Breast cancer screening programmes in 22 countries: current policies, administration and guidelines. Int J Epidemiol 1998;27:735–742.[Abstract/Free Full Text]
  4. Field S. Breast screening issues: a report into workforce, funding, litigation and morale. Newsl R Coll Radiol 1998;54:12.
  5. Department of Health. The NHS Cancer Plan. London, England: Her Majesty's Stationery Office, 2000.
  6. Blanks RG, Wallis MG, Moss SM. A comparison of cancer detection rates achieved by breast cancer screening programmes by number of readers, for one and two view mammography: results from the UK National Health Service breast screening programme. J Med Screen 1998;5:195–201.[Abstract/Free Full Text]
  7. Deans HE, Everington D, Cordiner C, et al. Scottish experience of double reading in the National Breast Screening Programme. Breast 1998;7:75–79.
  8. Anttinen I, Pamilo M, Soiva M, Roiha M. Double reading of mammography screening films: one radiologist or two? Clin Radiol 1993;48:414–421.[CrossRef][Medline]
  9. Thurfjell E, Thurfjell MG, Egge E, Bjurstam N. Sensitivity and specificity of computer-assisted breast cancer detection in mammography screening. Acta Radiol 1998;39:384–388.[Medline]
  10. Destounis SV, DiNitto P, Logan-Young W, Bonaccio E, Zuley ML, Willison KM. Can computer-aided detection with double reading of screening mammograms help decrease the false-negative rate? initial experience. Radiology 2004;232:578–584.[Abstract/Free Full Text]
  11. Harvey SC, Geller B, Oppenheimer RG, Pinet M, Riddell L, Garra B. Increase in cancer detection and recall rates with independent double interpretation of screening mammography. AJR Am J Roentgenol 2003;180:1461–1467.[Abstract/Free Full Text]
  12. Freer TW, Ulissey MJ. Screening mammography with computer-aided detection: prospective study of 12 860 patients in a community breast center. Radiology 2001;220:781–786.[Abstract/Free Full Text]
  13. Savage CJ, Gale AG, Pawly EF, Wilson ARM. To err is human, to compute divine? In: Gale AG, Astely SM, Dance DR, Cairns AY, eds. Digital mammography. Amsterdam, the Netherlands: Elsevier, 1994; 405–414.
  14. Boggis CR, Astley SM. Computer-assisted mammographic imaging. Breast Cancer Res 2000;2:392–395.[CrossRef][Medline]
  15. te Brake GM, Karssemeijer N, Hendriks JH. Automated detection of breast carcinomas not detected in a screening program. Radiology 1998;207:465–471.[Abstract/Free Full Text]
  16. Warren Burhenne LJ, Wood SA, D'Orsi CJ, et al. Potential contribution of computer-aided detection to the sensitivity of screening mammography. Radiology 2000;215:554–562.[Abstract/Free Full Text]
  17. Birdwell RL, Ikeda DM, O'Shaughnessy KF, Sickles EA. Mammographic characteristics of 115 missed cancers later detected with screening mammography and the potential utility of computer-aided detection. Radiology 2001;219:192–202.[Abstract/Free Full Text]
  18. Brem RF, Baum J, Lechner M, et al. Improvement in sensitivity of screening mammography with computer-aided detection: a multiinstitutional trial. AJR Am J Roentgenol 2003;181:687–693.[Abstract/Free Full Text]
  19. Gur D, Sumkin JH, Rockette HE, et al. Changes in breast cancer detection and mammography recall rates after the introduction of a computer-aided detection system. J Natl Cancer Inst 2004;96:185–190.[Abstract/Free Full Text]
  20. Karssemeijer N, Otten JD, Verbeek AL, et al. Computer-aided detection versus independent double reading of masses on mammograms. Radiology 2003;227:192–200.[Abstract/Free Full Text]
  21. Taylor P, Given-Wilson R, Champness J, Potts HW, Johnston K. Assessing the impact of CAD on the sensitivity and specificity of film readers. Clin Radiol 2004;59:1099–1105.[CrossRef][Medline]
  22. Ciatto S, Del Turco MR, Risso G, et al. Comparison of standard reading and computer-aided detection (CAD) on a national proficiency test of screening mammography. Eur J Radiol 2003;45:135–138.[CrossRef][Medline]
  23. National Health Service Breast Screening Programme. Quality assurance guidelines for breast cancer screening radiology. Publication no. 59. Sheffield, England: National Health Service Breast Screening Programme, 2005.
  24. Astley S, Quarterman C, Al Nuaimi Y, et al. Computer-aided detection in screening mammography: the impact of training on reader performance. In: Pisano E, ed. Proceedings of the Seventh International Workshop on Digital Mammography, Chapel Hill NC, June 18–21, 2004.
  25. McNemar Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 1947;12:153–157.[CrossRef]
  26. Stata statistical software, version 8.0 [package insert]. College Station, Tex: Stata, 2003.
  27. Garvican L, Field S. A pilot evaluation of the R2 image checker system and users' response in the detection of interval breast cancers on previous screening films. Clin Radiol 2001;56:833–837.[CrossRef][Medline]
  28. Krupinski EA. Computer-aided detection in clinical environment: benefits and challenges for radiologists. Radiology 2004;231:7–9.[Free Full Text]
  29. National Health Service Breast Screening Programme. Annual review. Publication no. 56. Sheffield, England: National Health Service Breast Screening Programme, 2003.
  30. Young KC, Ramsdale ML. Performance of mammography equipment in the UK breast screening programme in 2000/2001. Publication no. 56. Sheffield, England: National Health Service Breast Screening Programme, 2003.
  31. National Health Service Breast Screening Programme. Consolidated guidance on standards for the NHS breast screening programme. Publication no. 60 (version 2). Sheffield, England: National Health Service Breast Screening Programme, 2005.
  32. Blanks RG, Bennett RL, Patnick J, Cush S, Davison C. The effect of changing from one to two views at incident (subsequent) screens in the NHS breast screening programme in England: impact on cancer detection and recall rates. Clin Radiol 2005;60:674–680.[CrossRef][Medline]
  33. UK breast cancer incidence statistics. Cancer Research UK Web site. http://info.cancerresearchuk.org/cancerstats/types/breast/incidence/. Accessed June 9, 2005.
  34. Duncan KA, Needham G, Gilbert FJ, Deans HE. Incident round cancers: what lessons can we learn? Clin Radiol 1998;53:29–32.[CrossRef][Medline]
  35. Jones B, Jarvis P, Lewis JA, Ebbutt AF. Trials to assess equivalence: the importance of rigorous methods. BMJ 1996;313:36–39. [Published correction appears in BMJ 1996;313:550.][Free Full Text]



This article has been cited by other articles:


Home page
NEJMHome page
F. J. Gilbert, S. M. Astley, M. G.C. Gillan, O. F. Agbaje, M. G. Wallis, J. James, C. R.M. Boggis, S. W. Duffy, and the CADET II Group
Single Reading with Computer-Aided Detection for Screening Mammography
N. Engl. J. Med., October 16, 2008; 359(16): 1675 - 1684.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
M. Gromet
Comparison of Computer-Aided Detection to Double Reading of Screening Mammograms: Review of 231,221 Mammograms
Am. J. Roentgenol., April 1, 2008; 190(4): 854 - 859.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
R. F. Brem
Blinded Comparison of Computer-Aided Detection with Human Second Reading in Screening Mammography: The Importance of the Question and the Critical Numbers Game
Am. J. Roentgenol., November 1, 2007; 189(5): 1142 - 1144.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Duffy, S. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Gilbert, F. J.
Right arrow Articles by Duffy, S. W.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE