|
|
||||||||
Computer Applications |
1 From the Department of General Surgery, China Medical College and Hospital, 2 Yer-Der Rd, Taichung, Taiwan, Republic of China (D.R.C.), and the Department of Computer Science and Information Engineering, National Chung Cheng University, Chiayi, Taiwan, Republic of China (R.F.C., Y.L.H.). Received May 16, 1998; revision requested July 30; final revision received April 22, 1999; accepted June 8. Address reprint requests to D.R.C. (e-mail: dlchen88@ms13.hinet.net).
| Abstract |
|---|
|
|
|---|
MATERIALS AND METHODS: One hundred forty US images of solid breast nodules were evaluated. When a sonogram was obtained, an analog video signal from the VCR output of the scanner was transmitted to a notebook computer. A frame grabber connected to the printer port of the computer was then used to digitize the data. The suspicious tumor region on the digitized US image was manually selected. The texture information of the subimage was extracted, and a neural network classifier with autocorrelation features was used to classify the tumor as benign or malignant. In this experiment, 140 pathologically proved tumors (52 malignant and 88 benign tumors) were sampled with k-fold cross-validation (k = 10) to evaluate the performance with receiver operating characteristic curves.
RESULTS: The accuracy of neural networks for classifying malignancies was 95.0% (133 of 140 tumors), the sensitivity was 98% (51 of 52), the specificity was 93% (82 of 88), the positive predictive value was 89% (51 of 57), and the negative predictive value was 99% (82 of 83).
CONCLUSION: This system differentiated solid breast nodules with relatively high accuracy and helped inexperienced operators to avoid misdiagnoses. Because the neural network is trainable, it could be optimized if a larger set of tumor images is supplied.
Index terms: Breast neoplasms, diagnosis, 00.30 Breast neoplasms, US, 00.1298 Computers, diagnostic aid Images, analysis Images, interpretation Ultrasound (US), tissue characterization, 00.1298
| Introduction |
|---|
|
|
|---|
Although the smaller hospitals use conventional ultrasonographic (US) equipment to obtain medical images quickly, estimates of the accuracy of US diagnostic methods are controversial, and the role of US in current practice is not yet defined (2,3). One of the important reasons for this controversy is the considerable overlap of benign and malignant findings on US images; interpretation is subjective and depends on the operator.
The US examination described by Stavros et al (4) is much more extensive than the usual examinations performed at most breast imaging centers. In their study, Stavros et al (4) found that the sensitivity of breast US for malignancy was 98.4% (123 of 125 findings), the specificity was 67.8% (424 of 625), the overall accuracy was 72.9% (547 of 750), the positive predictive value was 38.0% (123 of 324), and the negative predictive value was 99.5% (424 of 426). However, these improved diagnostic results were achieved by experienced radiologists. In practice, many invasive diagnostic procedures are still required in most cases. Most of these procedures are avoidable because the rate of positive findings at biopsy for cancer is lowbetween 10% and 31% (57). To overcome the aforementioned shortcomings of US and to fully expand the capability of US, only an efficient computerized model offers objective evidence and avoids interobserver variations; therefore, an optimal and stable high diagnostic rate can be achieved by using such a model.
Both mammography (811) and US (12) can be used to detect and classify breast tumors. Although mammography can be used to visualize nonpalpable and small tumors, US is a convenient and safe tool to use in the classification of tumors, especially palpable tumors. Garra et al (13) suggest that analysis of US image texture is a simple means of markedly reducing the number of biopsies performed for benign lesions. Garra et al (13) used a co-occurrence matrix and the linear Bayesian classifier. Neural network techniques have been applied to detect microcalcifications and have even been applied to distinguish benign and malignant microcalcifications on digital mammographic images (911). Moreover, Choong et al (14) used neural networks to make prognoses for women with breast cancer.
The multilayer feed-forward neural network can be used to extract higher-order statistics by adding one or more hidden layers. This model has become extremely popular in terms of classification and prediction. The powerful error back-propagation algorithm proposed by Rumelhart el al (15) and Hirose et al (16) is the most widely used algorithm for multilayer feed-forward neural networks. We used this neural network model as a classifier to determine whether the breast tumors were benign or malignant. The diagnostic model proposed herein can exploit the nonlinear property of the learning algorithm of the neural network to classify the US images of the breast more accurately. The multilayer feed-forward neural network is a reliable choice for the proposed model because it trains well and computes efficiently.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Data Acquisition
The US image database contained 140 images of pathologically proved tumors: benign breast tumors from 88 patients and carcinomas from 52 patients (tumor size, >1 cm in all patients). The database contained only one image from each patient. The US images that depicted the largest diameter of the tumor were captured. The images were collected from January 1, 1997, to May 31, 1998, and the patients' ages ranged from 18 to 62 years. US was performed by using an Aloka SSD 1200 scanner (Tokyo, Japan) and a 7.5-MHz linear transducer with freeze-frame capability. No acoustic standoff pad was used with any of the patients. The US gain setting remained unchanged throughout the entire study, except for changes made to obtain the best view.
The following description explains how digital US images were obtained. When a sonogram was obtained, an analog video signal was transmitted from the VCR output of the scanner to a notebook computer. The data were then digitized by means of a Video CATcher frame grabber (Top Solution Technology; Taipei, Taiwan, Republic of China) that was connected to the printer port of the computer. The capturing resolutions of the portable computer and the external frame grabber was 736 x 566 pixels for a National Television Systems Committee video-screen picture. We used the ProImage software package from Prolab (version 2.0; Taipei, Taiwan, Republic of China), which was bundled with the frame grabber, to capture the real-time digital image. The monochrome US image was quantized to 8 bits (ie, 256 gray levels). The subimage of the region of interest was manually selected by using the ProImage (Prolab) package. That is, the software package was used to capture the full image from the US scanner and to select the region-of-interest subimage manually. The region-of-interest subimage was then saved as a file for later analysis with the neural network program. Figure 1 illustrates a real-time digitized monochrome US image of a malignant tumor, with 58 x 58 pixels in a 1 x 1-cm rectangle (Fig 1, A) and an image of a tumor (approximately 3.1 x 1.5 cm) that was captured with a resolution of 179 x 88 pixels (Fig 1, B).
|
|
|
between pixel (i, j) and pixel (i +
m, j +
n) on an image with size m x n can be defined as
Moreover, the two-dimensional autocorrelation coefficients were further modified to a mean-removed version to generate the similar autocorrelation features for images with different brightness but with similar texture. This modified version was expressed as
was the mean value of f(x, y). The absolute value was used in Equation (3) because a negative value may be produced when the mean is subtracted from the gray level of a pixel. We determined these two-dimensional autocorrelation coefficients for each US image of a breast tumor and used these coefficients as the interpixel features to distinguish between benign and malignant tumors.
Neural Network Classification
A multilayer feed-forward neural network contains one or more hidden layers. The function of neurons in the hidden layer is to arbitrate between the input and the output of the neural network. The input feature vector is fed into the source nodes in the input layer of the neural network. The neurons of the input layer constitute the input signals applied to the neurons in the hidden layer. The output signals of the hidden layer can be used as inputs to the next hidden or output layer. Finally, the output layer produces the output result and terminates the neural computing process.
Among the algorithms used to design the multilayer feed-forward neural networks, the back-propagation algorithm is the most popular. In general, there are two different phases in the back-propagation algorithm: the forward phase and the backward phase. In the forward phase, the input signals are computed and passed through the neural network, layer by layer. Then, the neurons in the output layer produce the output signals of the neural network. During this phase, comparing the output response of the neural network with the desired response can generate error signals.
During the backward phase of the back-propagation algorithm, some free parameters can be adjusted by referring to the error signals. This work can be used to minimize the distortion of the neural network. The multilayer feed-forward neural network has a high capability for learning and for computational efficiency. We iteratively executed the back-propagation learning algorithm for the training set and then produced the synaptic weight vectors that were applied to the neural network. We classified the benign and malignant tumors in the diagnostic model by applying the final synaptic weight vectors to the multilayer feed-forward neural network.
We used the modified version of the two-dimensional normalized autocorrelation matrix for the input of the neural network. The dimension of the matrix can be fixed for an image of any size. In this study, both
m and
n were 5, so processing a US image produced a 5 x 5 autocorrelation matrix (ie, 25 autocorrelation coefficients). The value of
(0, 0) was always 1 for a normalized autocorrelation matrix. Thus, except for the element
(0, 0), other autocorrelation coefficients were formed as a 24-dimensional image feature vector.
We used a multilayer feed-forward neural network with 25 input nodes, 10 hidden nodes, and one output node, as illustrated in Figure 3. The 24-dimensional image feature vector and a predefined threshold of the input layer were used as the input signals for the neural network. Moreover, the value produced by the output node was used to decide whether a tumor was benign or malignant. The output value of the neural network was either 0 or 1. When the output value of a US breast image was near enough to 1, the system classified the tumor on the image as malignant. Conversely, when the output value was close to 0, the system classified the tumor as benign. Figure 4 illustrates the structure of the neural network model for tissue classification. Figure 5 illustrates the program of the neural network diagnostic model. In this program, the output of the neural network was multiplied by 100.
|
|
|
| RESULTS |
|---|
|
|
|---|
m and
n, the mean difference was larger, as illustrated in Figure 6. Table 2 lists the number of training iterations and error distortions for each training set. The error distortion was defined as the absolute difference between the desired output and the actual output of the neural network. Figure 7 illustrates the ROC curve for the neural network in the classification of malignant and benign tumors. The overall performance of the neural network was evaluated by examining the ROC area index AZ over the testing output values. Our method had a high AZ value of 0.9560 ± 0.0183 (SD). Table 3 lists the performance for different threshold values. With a threshold of 0.2, the network correctly identified 51 of 52 malignant tumors and 82 of 88 benign tumors. Table 4 lists the number of tumors that were misdiagnosed in each test set by using the neural network at a threshold of 0.2. The accuracy of the neural network for detecting malignant tumors was 95.0% (133 of 140), the sensitivity was 98% (51 of 52), the specificity was 93% (82 of 88), the positive predictive value was 89% (51 of 57), and the negative predictive value was 99% (82 of 83), as illustrated in Table 5. Table 6 lists the number of tumors of various specific types in this study. From the limited data in this study, the tumor type seemed to have no relationship with the ability of the network to differentiate benign and malignant tumors.
|
|
|
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
The architecture of the neural network scheme was simple, redressed easily, and was appropriate for hardware design. Our scheme used only one type of neural network model, that is, the multilayer feed-forward neural network. Moreover, when the performance is suboptimal for new US images, these images can be added to the original training set to produce a new set of synaptic weight vectors by adjusting the free parameters. A new synaptic weight vector for the neural network can control the performance of the system. Notably, the neural network modules can redress synaptic weight vectors without modifying the other functions.
According to the experimental results, the performance of our diagnostic model in making differential diagnoses was very good. These results indicate that benign and malignant tumors can be distinguished by using interpixel correlations on digital US images. From the highly satisfactory specificity and sensitivity of the results, our system is expected to be a useful computer-aided diagnostic tool in the classification of benign and malignant tumors on sonograms. It can provide a second reading to help reduce misdiagnoses. Further studies of larger test sets of tumor images are underway.
| Footnotes |
|---|
Abbreviation: ROC = receiver operating characteristic
Author contributions: Guarantors of integrity of entire study, D.R.C., R.F.C., Y.L.H.; study concepts, D.R.C.; study design, D.R.C., R.F.C.; definition of intellectual content, D.R.C., R.F.C., Y.L.H.; literature research, D.R.C., R.F.C., Y.L.H.; clinical studies, D.R.C.; experimental studies, Y.L.H., R.F.C.; data acquisition, D.R.C., R.F.C., Y.L.H.; data analysis, Y.L.H., R.F.C.; statistical analysis, D.R.C.; manuscript preparation and review, D.R.C., R.F.C., Y.L.H.; manuscript editing, D.R.C., R.F.C.
| References |
|---|
|
|
|---|
Related Article
This article has been cited by other articles:
![]() |
B. Sahiner, H.-P. Chan, M. A. Roubidoux, L. M. Hadjiiski, M. A. Helvie, C. Paramagul, J. Bailey, A. V. Nees, and C. Blane Malignant and Benign Breast Masses on 3D US Volumetric Images: Effect of Computer-aided Diagnosis on Radiologist Accuracy Radiology, March 1, 2007; 242(3): 716 - 724. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-J. Chen, K.-S. Cheng, Y.-C. Dai, Y.-N. Sun, Y.-T. Chen, K.-Y. Chang, S.-N. Yu, T.-W. Chang, H.-M. Tsai, and C.-C. Hsien Quantitatively Characterizing the Textural Features of Sonographic Images for Breast Cancer With Histopathologic Correlation J. Ultrasound Med., May 1, 2005; 24(5): 651 - 661. [Abstract] [Full Text] [PDF] |
||||
![]() |
D.-R. Chen, R.-F. Chang, W.-M. Chen, and W.-K. Moon Computer-Aided Diagnosis for 3-Dimensional Breast Ultrasonography Arch Surg, March 1, 2003; 138(3): 296 - 302. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-M. Chen, Y.-H. Chou, K.-C. Han, G.-S. Hung, C.-M. Tiu, H.-J. Chiou, and S.-Y. Chiou Breast Lesions on Sonograms: Computer-aided Diagnosis with Nearly Setting-Independent Features and Artificial Neural Networks Radiology, February 1, 2003; 226(2): 504 - 514. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Tourassi Journey toward Computer-aided Diagnosis: Role of Image Texture Analysis Radiology, November 1, 1999; 213(2): 317 - 320. [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |