|
|
||||||||
Technical Developments |
1 From the Surgical Planning Laboratory, Depts of Radiology (M.R.K., S.K.W., A.N., F.A.J., R.K.) and Neurosurgery (A.N., P.M.B.), Brigham and Womens Hospital, Harvard Medical School, 75 Francis St, Boston, MA 02115; and Lehrstuhl Technische Elektronik, University Erlangen-Nürnberg, Erlangen, Germany (M.R.K.). Received Dec 30, 1999; revision requested Feb 21, 2000; revision received and accepted May 5. Supported in part by Deutscher Akademischer Austauschdienst D/96/34017 (M.R.K.); National Multiple Sclerosis Society grant (S.K.W.); NIH grants P01 CA67165 (R.K., F.A.J.), 1R01RR11747 (R.K.), 1P41RR13218 (R.K., F.A.J., S.K.W.); and Deutsche Forschungsgemeinschaft HA 359/1-1 (A.N.). Address correspondence to R.K. (e-mail: kikinis@bwh.harvard.edu).
| ABSTRACT |
|---|
|
|
|---|
Index terms: Brain neoplasms, 10.363, 10.366 Brain neoplasms, MR, 10.121412, 10.12143 Magnetic resonance (MR), technology Magnetic resonance (MR), three-dimensional, 10.121412, 10.12143 Magnetic resonance (MR), volume measurement, 10.121412, 10.12143 Technology assessment
| INTRODUCTION |
|---|
|
|
|---|
Although 2D images accurately depict the size and location of anatomic objects, the process of generating 3D views to visualize structural information and spatial anatomic relationships is a difficult task, which is usually carried out in the clinicians mind. Image-processing tools provide the surgeon with interactively displayed 3D visual information that is somewhat similar to the view of the surgeon during surgery; the use of these tools facilitates comprehension of the entire anatomy. For example, the (mental) 3D visualization of structures that do not readily align with the planes of the images (eg, the vascular tree) is difficult if it is based on 2D images alone.
Image-based modeling requires the use of computerized image-processing methods, which include segmentation, registration, and display. Segmentation with statistical classification techniques (7,8) has been successfully applied to gross tissue type identification. Because the acquisition of tissue parameters is insufficient for successful segmentation due to the lack of contrast between normal and pathologic tissue (9,10), statistical classification may not allow differentiation between nonenhancing tumor and normal tissue (1113). Explicit anatomic information derived from a digital atlas has been used to identify normal anatomic structures (1416).
We developed an automated segmentation tool that can be used to identify the skin surface, ventricles, brain, and tumor in patients with brain neoplasms (17,18). The purpose of the current study was to compare the accuracy and reproducibility of this automated method with those of manual segmentation carried out by trained personnel.
| Materials and Methods |
|---|
|
|
|---|
Brain Tumor Patients
Twenty patients were selected from a neurosurgical database of images in approximately 260 patients with brain tumors. Cases of the 260 patients had been postprocessed for image-guided neurosurgery by using a combination of semiautomated techniques and manual outlining of the skin surface, brain, ventricles, vessels, and tumor.
Two neurosurgeons (including A.N.) were asked to select 20 cases with meningiomas and low-grade gliomas of different sizes, shapes, and locations to provide a representative selection. These two types were selected because they are relatively homogeneous and have well-defined imaging characteristics. Pathologic diagnoses included six meningiomas (cases 13, 11, 12, 16), and 14 low-grade gliomas (cases 410, 1315, 1720). In this study, six of six meningiomas were well enhancing, and 14 of 14 low-grade gliomas were nonenhancing.
Cases 110 formed the development database used for the design and validation of the automated segmentation method. To ensure that the method produced correct results when applied to cases other than those of the development database, validation was carried out separately with the validation data sets from cases 1120 in addition to validation with the 10 development cases.
Automated Segmentation of Brain and Tumor
General segmentation framework.We adopted a general algorithm called adaptive templatemoderated classification (see references 17 and 18 and the Appendix for details). The technique involves the iteration of statistical classification to assign labels to tissue types and nonlinear registration to align (register) a digital anatomic atlas (presegmented anatomic map) to the patient data (Fig 1). Statistical classification was used to divide an image into different tissue classes on the basis of the signal intensity value. If different tissue classes have the same or overlapping grey-value distributions (eg, cerebrospinal fluid and fluid within the eyeballs), such methods fail. Therefore, additional information about the spatial location of anatomic structures was derived from a registered anatomic atlas (manually segmented MR image of a single subject) (6). Objects of interest were identified on the classified images with local segmentation operations (mathematic morphology and region growing) (19).
|
|
At this point, all voxels belonging to the brain, ventricles, and tumor were labeled as ICC. In the first iteration, the ICC was segmented by using the head and ICC template from the initially registered atlas. The atlas was then realigned on the basis of the whole head and ICC of the patient. This step was followed by a second classification and local segmentation step. The ventricles were segmented from the ICC in a third segmentation iteration. At this point, the ICC contained only voxels belonging to the brain and tumor.
Having defined a region of interest for the tumor, which was located inside the brain and outside the ventricles and skin (fat and bone), the tumor was segmented in two iteration cycles. In the first iteration, the tumor was classified by using the anatomic knowledge from only the atlas; this step was followed by application of the local segmentation strategy. Because there was no tumor template in the atlas, a straightforward registration was not possible. Consequently, tumor voxels were relabeled as ICC voxels prior to the registration process. As a result, a spatial correspondence between the atlas and patient data set was established for every voxel, since the patient data set contained no voxels labeled as tumor at the time registration of the atlas was carried out.
In the second iteration, tumor segmentation from the first iteration was used as an anatomic template. Although this template was approximate, the additional information about the location of the tumor prevented misclassification of voxels distant to the atlas template as tumor.
Initialization of the automated segmentation method.To reduce noise on the MR image without blurring object edges, an anisotropic diffusionfiltering method was applied (20). For the initialization of the automated segmentation method, a graphical user interface was developed for the 2D display of MR imaging sections and the selection of example tissue points with use of a mouse (Fig 3). The only interaction required by the operator (see Validation Experiments) was the selection of three to four example points for each tissue class, that is, skin (fat and bone), brain, ventricles, and tumor. The program calculated a statistical model for the distribution of the gray values on the basis of these manually selected tissue prototypes.
|
Validation Experiments
Because of the lack of an acceptable standard (eg, realistic phantom) for comparison, our definition of a segmentation standard was based on the manual segmentations with interactive computer segmentation tools. However, manual segmentation is subject to interobserver variability and human error (6). To minimize the influence of these factors while maintaining a means of measuring the segmentation accuracy of the individual raters, the standard was defined on the basis of the segmentations of four independent human observers. A single 2D section was randomly selected from the subset of the MR imaging volume that showed the tumor. The four human observers then independently outlined the brain and tumor on this section by hand. The standard segmentation of brain and tumor in each patient data set was defined as the area of those voxels in which at least three of four raters agreed regarding their identification. All other voxels were labeled as background.
To assess accuracy, the automated segmentation tool was trained once with a single MR imaging section containing all tissue types of interest and was executed on the full 3D data set. This process resulted in segmentation of the entire data set. For each data set, the structures skin (fat and bone), brain, ventricles, and tumor were segmented.
The interrater variability of the four independent manual and the four independent automated segmentations was measured on the basis of all 20 cases. For the measurement of intraobserver variability, one of the medical experts also manually segmented the selected 2D section four times during 1 week in each of the 20 cases. Training of the automated method was also carried out four times during 1 week in all 20 cases.
During all experiments, the times for manual outlining, training, and computation for the automated segmentation method were recorded.
Statistical Analysis
Qualitative analysis was carried out on the basis of volume-of-overlap comparison with standard (accuracy) and overall volume variability (reproducibility) in the 2D section selected. Segmentation accuracy was defined as the percentage of correctly classified voxels (in object and background) with respect to the total number of voxels V on the image, that is, (TP + TN)/V, where TP is the number of true-positive voxels and TN is the number of true-negative voxels (21). The mean and SDs of the accuracy values with respect to the 20 test cases were also calculated (Matlab version 4.1; Mathworks, Cambridge, Mass).
To assess the inter- and intrarater variability error, the coefficient of variation CV% was calculated as follows: CV% = 100(SDvolume/Meanvolume). The coefficient of variation does not measure the correctness of segmentation, only the change in the volume of objects in segmentations of different raters.
| Results |
|---|
|
|
|---|
|
|
|
|
|
Automated segmentation of a complete 3D image volume required approximately 75 minutes of unsupervised computation time (Sun ES 6000 server, 20 central processing units with 250-MHz speed and 5 Gbyte of random-access memory; Sun Microsystems). The overall operator time for training of the automated method was approximately 510 minutes (selection of example voxels for each of the relevant tissue classes). Manual outlining of brain and tumor required 13 minutes per section. Time for manual segmentation of the 3D volume was on the order of 35 hours.
| Discussion |
|---|
|
|
|---|
The goals of the development of automated segmentation tools are to make segmentation of MR images more practical by replacing manual outlining, which reduces operator time, without a measurable effect on the results and to improve reproducibility. However, the validity of our segmentations is difficult to assess without the availability of a standard. Therefore, our validation study was designed to determine how closely the raters agreed within a single method (automated and manual) and how closely the segmentation results correlated between the two methods.
Segmentation accuracy with the automated method was high and within (maximum difference, 0.6%) the accuracy range of the manual method. The errors with automated brain segmentation were in part due to over- and undersegmentation in the area of the tentorium and the lateral sulcus with abundant vessels. The algorithm tended to cause oversegmentation in these areas if parts of the neck near the cerebellum were misclassified as brain and if the ICC template derived from the atlas was misaligned.
The size of the structure affects segmentation accuracy. Segmentation errors occur on the boundary of surfaces. Thus, the larger the surface of an object, the more voxels on the entire image that can potentially be misclassified. Therefore, accuracy is lower with larger objects than with smaller objects.
Reproducibility was higher with the automated method because only the selection of a few example points is required, not decision making for every voxel on the image during manual segmentation. The reproducibility of brain and tumor segmentation was high. Nevertheless, the inter- and intraobserver reproducibility of both methods was higher with the brain than with the tumor. Larger objects tend to have a volumetric reproducibility that is higher than the overall segmentation accuracy. Because the surface-to-volume ratio behaves approximately like 1/r (where r is the object radius), the disagreement about voxel classes on the surface of larger objects with respect to the overall volume is less consequential than it is with smaller objects.
Interobserver variability was substantially reduced with the automated method. Manual interobserver variability was particularly high for low-grade gliomas, which were more difficult to segment, causing deviating expert opinions. Automated segmentation is more robust to expert variation because it involves only the selection of typical example points for training the algorithm, while manual segmentation requires a human decision for every boundary voxel, which is difficult due to, for example, partial voluming. However, intraobserver variability was improved only with meningioma segmentation. For low-grade gliomas, manual intraobserver variability is substantially lower than interobserver variability because the execution of manual segmentation varies, but the opinion regarding the shape of the tumor does not. Therefore, compared with manual segmentation, automated segmentation does not reduce interobserver variability substantially.
Reproducibility was higher with meningiomas with both methods. This findings can be explained by comparing the gray-value distributions of the meningiomas or low-grade gliomas with that of the brain. The meningioma tissue class partially overlaps parts of the skin, the fat in the neck, and the straight and superior sagittal sinuses, and it was well distinguishable from brain tissue with the application of a contrast agent. When the region of interest was restricted to the ICC, the tissue that showed signal intensity overlap with the meningioma was excluded, and the meningioma was successfully segmented.
In some cases of low-grade glioma, the ICC may not have been a sufficient ROI for accurate tumor segmentation due to the similar signal intensities of the tumor and surrounding gray matter. False classifications cannot be corrected if the brain misclassified as tumor tissue is adjacent to the tumor boundary (oversegmentation) or vice versa (undersegmentation). The incorporation of T2-weighted images, which clearly distinguish the tumor as hyperintense tissue, may enable the precise definition of the tumor boundaries. If the voxels of the brain misclassified as tumor are distant to the tumor boundary, if they are connected to the tumor by only thin structures, or if tumor voxels inside the tumor are falsely classified as brain, false classifications can be corrected.
The algorithm developed in this work is based on template-driven segmentation in which an anatomic atlas is used to guide a statistical classification process (8,14,17,18,23). Clark et al (27) proposed a method for automatic detection and segmentation of glioblastoma multiforme on a combination of T1-, T2-, and intermediate-weighted MR images with use of classification and an anatomic knowledge database; accuracy was greater than 90%. Bonnie et al (24) recently reported results with use of an interactive tumor segmentation method. However, its value is difficult to assess because no detail on the segmentation technique is given. Approaches based on MR imaging data alone with use of active contours (25) or multispectral classification (12,13) work well if the tumor shows sufficient contrast to the brain. However, active contours require good initialization, which is difficult to automate, while multispectral classification reveals problems with overlapping intensity distributions.
The lack of automated segmentation methods results in tedious manual labor. This result has been one of the reasons why 3D models have been typically limited to university research settings. The reduction in operator time (35 hours to 510 minutes) makes it practical to consider the integration of computerized segmentation into daily clinical practice for presurgical 3D planning and intraoperative navigation in routine neurosurgical procedures. A technician carries out the initial work, and a radiologist verifies the result is verified while softreading the images. Our software is currently used on a powerful computer; systems such as ours are becoming increasingly affordable (26).
In conclusion, accurate segmentation is possible for meningiomas and low-grade gliomas with our automated method. Further work is required to extend the tools to a broader range of brain tumors (eg, glioblastoma multiforme). Future clinical studies on the accuracy and reproducibility of our technique in a larger population will be necessary to determine its practical use in a clinical setting.
| APPENDIX |
|---|
|
|
|---|
= 5.2; kNN classification k, 5; number of classes C, five; affine registration, nine degrees of freedom; image resolution levels, three; distance transform saturation distance, 100; nonlinear registration, three resolution levels; window size w, 9 x 9 x 9; morphologic operators, spherical element; size, 7 x 7 x 7; region growing, connectivity of 18. Four classification-registration iterations were used for ICC segmentation, one iteration was used for ventricle segmentation, and two iterations were used for tumor segmentation. The brain and ventricle are also resegmented during tumor segmentation.
For segmentation of normal structures (ie, skin, fat, and bone; brain; ventricles), the pattern used in this work was vi = [v1i, ..., v5i]T, where i is the index to voxel location xi. The elements vji result from image processing operations Tj as follows: v1i = T1[I(xi)], where T1 is anisotropic diffusion filtering ; v2i = T2[A(xi)], where T2 is the distance transform of skin, fat, and bone; v3i = T3[A(xi)], where T3 is the distance transform of the background of skin, fat, and bone; v4i = T4[A(xi)], where T4 is the distance transform of brain; and v5i = T5[A(xi)], where T5 is the distance transform of the background of the brain. The elements are applied to the MR image I(xi) or the image of the registered anatomic atlas A(xi). While T1 is carried out only during the preprocessing stage, the operators T2 to T5 are applied to the reregistered atlas in every segmentation iteration cycle.
For the first tumor segmentation cycle, the patterns are also vi = [v1i, ..., v5i]T. For the second tumor segmentation cycle, the patterns are vi = [v1i, ..., v6i]T, where v1i for i = 15 are defined as above but with the additional pattern v6i = T6[A(xi)], T6: distance transform of initial tumor segmentation where Ib is the resultant image of the first tumor segmentation.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Author contributions: Guarantors of integrity of entire study, all authors; study concepts, all authors; study design, M.R.K., A.N., S.K.W., R.K.; definition of intellectual content, all authors; literature research, M.R.K.; clinical studies, A.N., P.M.B.; data acquisition, M.R.K., A.N.; data analysis, M.R.K.; statistical analysis, M.R.K., S.K.W.; manuscript preparation, M.R.K.; manuscript editing, M.R.K.; manuscript review, all authors.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B Cornelissen, V Kersemans, L Jans, L Staelens, R Oltenfreiter, T Thonissen, E Achten, and G Slegers Comparison between 1 T MRI and non-MRI based volumetry in inoculated tumours in mice Br. J. Radiol., April 1, 2005; 78(928): 338 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. F. H. Chong, J.-Y. Zhou, J. B. K. Khoo, J. Huang, and T.-K. Lim Nasopharyngeal Carcinoma Tumor Volume Measurement1 Radiology, June 1, 2004; 231(3): 914 - 921. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Sonnad Describing Data: Statistical and Graphical Methods Radiology, December 1, 2002; 225(3): 622 - 628. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |