|
|
||||||||
Breast Imaging |
1 From the Departments of Radiology and Biomedical Engineering, the UNC-Lineberger Comprehensive Cancer Center, and the Biomedical Research Imaging Center of the University of North Carolina School of Medicine, 101 Manning Dr, Room 515, Old Infirmary Building, Chapel Hill, NC 27599-7510 (E.D.P.); Center for Statistical Sciences, Brown University, Providence, RI (S.A., J.B.C., L.A.H., C.A.G.); Department of Radiology, Feinberg School of Medicine, Northwestern University, Chicago, Ill (R.E.H.); Departments of Medical Imaging (M.J.Y., R.A.J.) and Medical Biophysics (M.J.Y.), University of Toronto, Toronto, Ontario, Canada; Department of Radiology, Beth Israel Deaconess Medical Center, Boston, Mass (J.K.B.); Department of Radiology, University of California-Los Angeles, Los Angeles, Calif (L.W.B.); Department of Radiology, University of Pennsylvania, Philadelphia, Pa (E.F.C.); Department of Radiology, University of Iowa, Iowa City, Iowa (L.L.F.); Department of Radiology, William Beaumont Hospital, Royal Oak, Mich (M.R.); Department of Radiology, Emory University, Atlanta, Ga (C.J.D.); and Center for the Evaluative Clinical Sciences, Norris Cotton Cancer Center, Dartmouth Medical School, Lebanon, NH (A.N.A.T.). The remaining members of the DMIST Investigators Group are listed in the Appendix. Received January 30, 2007; revision requested April 11; revision received May 23; final version accepted July 2. Address correspondence to E.D.P. (e-mail: etta_pisano{at}med.unc.edu).
| ABSTRACT |
|---|
|
|
|---|
Materials and Methods: DMIST included women who underwent both digital and film screening mammography. Institutional review board approval at all participating sites and informed consent from all participating women in compliance with HIPAA was obtained for DMIST and this retrospective analysis. Areas under the receiver operating characteristic curve (AUCs) for each modality were compared within each subgroup evaluated (age < 50 vs 50–64 vs
65 years, dense vs nondense breasts at mammography, and pre- or perimenopausal vs postmenopausal status for the two younger age cohorts [10 new subgroups in toto]) while controlling for multiple comparisons (P < .002 indicated a significant difference). All DMIST cancers were evaluated with respect to mammographic detection method (digital vs film vs both vs neither), mammographic lesion type (mass, calcifications, or other), digital machine type, mammographic and pathologic size and diagnosis, existence of prior mammographic study at time of interpretation, months since prior mammographic study, and compressed breast thickness.
Results: Thirty-three centers enrolled 49 528 women. Breast cancer status was determined for 42 760 women, the group included in this study. Pre- or perimenopausal women younger than 50 years who had dense breasts at film mammography comprised the only subgroup for which digital mammography was significantly better than film (AUCs, 0.79 vs 0.54; P = .0015). Breast Imaging Reporting and Data System–based sensitivity in this subgroup was 0.59 for digital and 0.27 for film mammography. AUCs were not significantly different in any of the other subgroups. For women aged 65 years or older with fatty breasts, the AUC showed a nonsignificant tendency toward film being better than digital mammography (AUCs, 0.88 vs 0.70; P = .0025).
Conclusion: Digital mammography performed significantly better than film for pre- and perimenopausal women younger than 50 years with dense breasts, but film tended nonsignificantly to perform better for women aged 65 years or older with fatty breasts.
© RSNA, 2008
| INTRODUCTION |
|---|
|
|
|---|
These results have puzzled some readers (3), who have inferred that a significant difference in performance for digital mammography in one subset of the population with no difference in performance overall must mean that film mammography outperformed digital for some other subset. Because the results were not anticipated, the originally planned analysis did not attempt to dissect the effect of the three different factors defining the groups for which digital mammography performed better—that is, age, menopausal status, and breast density.
Breast density is known to affect mammographic accuracy (4) and is considered to be the most likely driving factor for the DMIST results (5). Age is correlated with density, with younger women tending to have mammographically denser breasts. Both of these factors were well classified in DMIST, with density defined by using the four-point Breast Imaging Reporting and Data System (BI-RADS) scale in use at the time the study was open to accrual. Menopausal status was defined in DMIST through self-report by each subject. A woman reporting regular menstruation was considered premenopausal. Women whose last menstrual period was less than 1 year prior to the study mammography and who stated they were no longer having regular menstruation were considered perimenopausal. Women whose last menstrual period was more than 1 year prior to the study mammogram were considered postmenopausal. All women who had undergone hysterectomy were defined as postmenopausal, regardless of their ovarian status and the date of their last menstrual period.
To provide additional information from the DMIST study beyond the originally planned and reported subset analyses, our study retrospectively compared the accuracy of digital versus film mammography in subgroups defined by combinations of age, menopausal status, and breast density, by using either biopsy results or follow-up information as the reference standard.
| MATERIALS AND METHODS |
|---|
|
|
|---|
DMIST, and this retrospective new analysis, had institutional review board approval at the American College of Radiology Imaging Network and at all participating sites (Appendix), participant informed consent, and compliance with the Health Insurance Portability and Accountability Act. No direct industrial support was provided for DMIST or for this study. Digital mammography machines (the Computed Radiography System for Mammography; Fuji Medical, Tokyo, Japan) that were not approved for purchase by the U.S. Food and Drug Administration at the time of DMIST were provided at the expense of the manufacturers. Coauthors who are not paid consultants or employees of the digital mammography machine manufacturers had control of inclusion of the data and the information that is included in this manuscript. (L.L.F. is a salaried member of the Board of Directors of and C.J.D. is a paid consultant to Hologic [Bedford, Mass]. C.J.D. is a paid consultant to GE Medical Systems [Milwaukee, Wis]. J.K.B. was paid as a consultant to Fischer Medical [Denver, Colo] during DMIST.)
Current Study Reference Standard
The reference standard status of each participant was defined as positive for malignancy if there was evidence of pathologically verified cancer within 455 days after initial study mammography and as negative for malignancy if the participant's status was not classified as positive and if their breast cancer status was determined to be negative at the enrolling institution 10 months or more after study entry, either through follow-up mammography (including subsequent work-up) or with other information. Participants whose status was neither positive nor negative were classified as having indeterminate status if they had undergone a breast biopsy with indeterminate results (defined as insufficient or nondiagnostic interpretations); had undergone a follow-up mammographic study that was interpreted as showing BI-RADS category 3, 4, or 5 findings and had no additional follow-up information; or had died during the follow-up period without a diagnosis of breast cancer. All other participants were classified as having an unknown reference standard.
Final Study Group
Participants with either positive or negative reference-standard status comprised the set of "fully verified" patients whose cancer status had been determined and who were used in the analysis as the final study group.
New Retrospective Subgroup Analyses
We undertook a new retrospective analysis of the accuracy of digital and film mammography for new population subsets defined by combinations of age, menopausal status, and mammographic density. Specifically, we compared digital and film mammography in 10 subgroups of women: pre- and perimenopausal women younger than 50 years with fatty breasts, pre- and perimenopausal women younger than 50 years with dense breasts, postmenopausal women younger than 50 years with fatty breasts, postmenopausal women younger than 50 years with dense breasts, pre- and perimenopausal women between 50 and 64 years of age with fatty breasts, pre- and perimenopausal women between 50 and 64 years of age with dense breasts, postmenopausal women between 50 and 64 years of age with fatty breasts, postmenopausal women between 50 and 64 years of age with dense breasts, women aged 65 years or older with fatty breasts, and women aged 65 years or older with dense breasts. Menopausal status was eliminated as a factor for women aged 65 years or older because there were so few pre- and perimenopausal women in this age group, and there is high likelihood that such women were misclassified. We chose to subdivide the group of women older than 50 years into two subgroups because of questions about the relative utility of the two mammographic technologies in the U.S. Medicare population, questions that are particularly relevant to the DMIST cost-effectiveness analysis, whose results will be reported elsewhere. All of these analyses were based on the original DMIST data, with interpretations performed by the on-site study radiologists.
In addition, we evaluated all cancers with respect to mammographic detection method (digital vs film vs both vs neither), mammographic lesion type (mass, calcifications, or other), digital machine type, mammographic and pathologic size and diagnosis, existence of prior mammographic study at the time of interpretation, months since prior mammographic study, and compressed breast thickness. Digital machine types included in DMIST were the SenoScan (Fischer Medical), the Computed Radiography System for Mammography (Fuji Medical), the Senographe 2000D (GE Medical Systems), the Digital Mammography System (Hologic), and the Selenia Full Field Digital Mammography System (Hologic). Mammographic lesion type (mass, calcifications, or other, with masses plus calcifications classified as masses) and size of breast cancers were recorded by a single radiologist (E.D.P., with 23 years of experience in interpreting breast imaging studies), who reviewed film and digital mammograms of all studies known to contain cancer side by side, with knowledge of lesion location and diagnosis after the primary DMIST study was completed. Mammographic size was measured in millimeters by using the digital and film mammogram that best depicted the lesion according to the judgment of the single reader. Pathologic size was obtained from pathology reports acquired from the clinical sites. Compressed breast thickness was obtained from the Digital Imaging and Communications in Medicine headers of the digital mammograms.
Statistical Analysis
Receiver operating characteristic curves for digital and film mammography were estimated from the pooled data across the study by using the seven-point malignancy score assigned to each patient at the time of screening mammography and before further work-up (1). The AUCs were compared by using the bivariate binormal model, which accounts for the paired test design (6,7). A corroborating nonparametric AUC analysis was also performed (8,9). Although the analysis for our current study was exploratory and the subsets were not planned in the original study, we controlled for multiple comparisons by using a Bonferroni correction, and we consider the AUC subset comparisons reported as additional comparisons to the 15 reported in the primary DMIST study. We required a P value of less than .002 (.05/25) to declare significance of differences in our current study.
In addition to the exploration of the AUC comparisons, estimates of the sensitivity, specificity, and positive predictive value (the so-called PPV1, henceforth referred to as PPV) of the two mammographic modalities were computed on the basis of the BI-RADS scores assigned at initial screening interpretation dichotomized into negative (BI-RADS scores of 1, 2, and 3) and positive (BI-RADS scores of 0, 4, and 5) scores. Comparisons of estimates were performed by using the McNemar test. Additional work-up status was used to classify the breast cancer status of subjects in some parts of the analysis. A participant's status was classified as positive for additional work-up if a BI-RADS score of 0, 4, or 5 had been assigned during the initial digital or film mammographic study or if the initial screening test resulted in a recommendation for further imaging (additional mammographic views, ultrasonography, magnetic resonance imaging), physical examination, or biopsy. In accordance with the approach taken in the primary DMIST study, the comparisons of sensitivity, specificity, and PPV were treated as descriptive in the analysis.
Statistical software (SAS, version 9.1, SAS Institute, Cary, NC; and ROCKIT, version 0.9 beta [available from the Kurt Rossmann Laboratories for Radiologic Image Research at the University of Chicago, Chicago, Ill, at http://www-radiology.uchicago.edu/krl/index.htm]) was used in the statistical analysis.
| RESULTS |
|---|
|
|
|---|
|
|
|
|
Also of interest is the presence of ductal carcinoma in situ and mammographically smaller lesions throughout both age groups. There were also no obvious trends regarding prior studies for comparison, in terms of either their availability or the length of time since the earlier study. In addition, we found no difference in compressed breast thickness that would explain the differences in performance between digital and film mammography in the two age groups.
| DISCUSSION |
|---|
|
|
|---|
We evaluated the possible causes for these trends, but could not identify a definitive cause on the basis of the original DMIST data. Because digital mammography has improved image contrast with worse spatial resolution compared with film, we evaluated the types of lesions found with both modalities in each subset of the population. We saw no difference in the mammographic lesion size and the presence of ductal carcinoma in situ that could explain the detection difference between the two populations, suggesting that differences in spatial resolution are not responsible for the trend seen between younger and older women. The only factor that seemed to be correlated with the poorer performance of digital mammography in older women was the higher percentage of cancers missed when digital screening was performed by using the Fischer unit compared with the other machine types. Very few cancers were detected with each machine, however, so the importance of this factor is difficult to determine with certainty. Of course, DMIST was not designed to answer this specific question, and it is not surprising that the study power was insufficient for us to answer definitively many interesting questions that arise from the results of our primary DMIST analysis.
Another possible explanation for the source of variation in diagnostic accuracy between digital and film mammography found in DMIST is the variability of the interpretive performance of the readers who participated in the study. This factor was mitigated as much as possible by the study design, in that all readers read approximately equal numbers of digital and film studies.
We believe our study provides additional information for researchers contemplating further research studies of digital mammography. However, it does not provide a definitive answer to the interesting question of why digital mammography performed better than film mammography for women with dense breasts, women younger than 50 years, and pre- and perimenopausal women and why there was a tendency toward better performance for film mammography for women aged 65 years or older with fatty breasts. The results of these exploratory analyses are presented as hypothesis generating. They do not supplant the previously published results (2) but rather serve to provide more detailed information and, ultimately, contribute to greater understanding of the factors that may affect the comparison of digital and screen-film mammography.
To address some of the remaining questions, we are currently performing a study in which highly experienced readers will compare digital and film mammograms for all DMIST cancers. Perhaps the image processing used for dense and fatty breasts causes a difference in performance of digital versus film for the two populations that is most apparent when the population is highly skewed to women with either very dense or very fatty breasts, as it would be in the youngest and oldest age groups studied. We hope such a review will help determine whether image characteristics themselves may explain the differences in diagnostic accuracies for these two distinct populations of women.
| APPENDIX |
|---|
|
|
|---|
|
| ADVANCES IN KNOWLEDGE |
|---|
|
|
|---|
| IMPLICATION FOR PATIENT CARE |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Abbreviations: AUC = area under the receiver operating characteristic curve BI-RADS = Breast Imaging Reporting and Data System DMIST = Digital Mammographic Imaging Screening Trial PPV = positive predictive value
Guarantors of integrity of entire study, E.D.P., C.A.G.; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; manuscript final version approval, all authors; literature research, R.E.H., M.J.Y., C.A.G.; clinical studies, R.E.H., M.J.Y., J.K.B., E.F.C., L.L.F., L.W.B., C.J.D., R.A.J.; statistical analysis, S.A., J.B.C., L.A.H., C.A.G.; and manuscript editing, R.E.H., M.J.Y., J.K.B., S.A., J.B.C., E.F.C., L.L.F., C.J.D., R.A.J., M.R., A.N.A.T., C.A.G.
NIH funding: This research was supported by the National Institutes of Health [grant number 5 U01 CA080098-09].
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Sala, M. Comas, F. Macia, J. Martinez, M. Casamitjana, and X. Castells Implementation of Digital Mammography in a Population-based Breast Cancer Screening Program: Effect of Screening Round on Recall Rate and Cancer Detection Radiology, July 1, 2009; 252(1): 31 - 39. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Vinnicombe, S. M. Pinto Pereira, V. A. McCormack, S. Shiel, N. Perry, and I. M. dos Santos Silva Full-Field Digital versus Screen-Film Mammography: Comparison within the UK Breast Screening Program and Systematic Review of Published Data Radiology, May 1, 2009; 251(2): 347 - 358. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Haygood, J. Wang, E. N. Atkinson, D. Lane, T. W. Stephens, P. Patel, and G. J. Whitman Timed Efficiency of Interpretation of Digital and Film-Screen Screening Mammograms Am. J. Roentgenol., January 1, 2009; 192(1): 216 - 220. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. B. Hruska, S. W. Phillips, D. H. Whaley, D. J. Rhodes, and M. K. O'Connor Molecular Breast Imaging: Use of a Dual-Head Dedicated Gamma Camera to Detect Small Breast Tumors Am. J. Roentgenol., December 1, 2008; 191(6): 1805 - 1815. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Hixson Sr, R. E. Hendrick, E. D. Pisano, M. J. Yaffe, and C. A. Gatsonis A Limitation of ACRIN DMIST Radiology, August 1, 2008; 248(2): 702 - 703. [Full Text] [PDF] |
||||
![]() |
D. B. Kopans, E. D. Pisano, S. Acharyya, R. E. Hendrick, M. J. Yaffe, E. F. Conant, L. L. Fajardo, L. W. Bassett, J. K. Baum, and C. A. Gatsonis DMIST Results: Technologic or Observer Variability? Radiology, August 1, 2008; 248(2): 703 - 704. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |