Published online before print February 7, 2008, 10.1148/radiol.2471071321
(Radiology 2008;247:12.)
A more recent version of this article appeared on April 1, 2008
© RSNA, 2008
Comparing Areas under Receiver Operating Characteristic Curves: Potential Impact of the "Last" Experimentally Measured Operating Point1
David Gur, ScD,
Andriy I. Bandos, PhD, and
Howard E. Rockette, PhD
1 From the Department of Radiology (D.G.) and Department of Biostatistics, Graduate School of Public Health (A.I.B., H.E.R.), University of Pittsburgh, Imaging Research, FARP Building, 3362 Fifth Ave, Pittsburgh, PA 15213. Received July 26, 2007; revision requested September 10; revision received September 24; final version accepted October 25. Supported in part by grants EB001694, EB002106, and EB003503 (to the University of Pittsburgh) from the National Institute for Biomedical Imaging and Bioengineering, National Institutes of Health.
Address correspondence to D.G. (e-mail: gurd{at}upmc.edu).
A specific issue related to the selection of the analytic tool used when comparing the estimated performance of systems within the receiver operating characteristic (ROC) paradigm is reviewed. This issue is related to the possible effect of the last experimentally ascertained ROC data point in terms of highest true-positive and false-positive fractions. An example of a case is presented where the selection of a specific analysis approach could affect the study conclusion from being not statistically significant for parametric analysis and significant for nonparametric analysis. This is followed by recommendations that should help prevent misinterpretation of results.
© RSNA, 2008
Copyright © 2008 by the Radiological Society of North America.