|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Thoracic Imaging |
1 From the Departments of Radiology (B.Z., L.H.S., L.W., M.S.G., C.A.C., L.J., J.P.K.) and Epidemiology and Biostatistics (C.S.M.), Memorial Sloan-Kettering Cancer Center, 1275 York Ave, New York, NY 10021. Received January 7, 2004; revision requested March 9; revision received May 3; accepted June 2. Supported in part by grants from William H. Goodwin and Alice Goodwin and the Commonwealth Foundation for Cancer Research and The Experimental Therapeutics Center of Memorial Sloan-Kettering Cancer Center. Address correspondence to B.Z. (e-mail: zhaob@mskcc.org).
| ABSTRACT |
|---|
|
|
|---|
MATERIALS AND METHODS: Analysis and data collection were approved by the Institutional Review Board, with waived informed patient consent. Forty-two pulmonary metastases in 10 patients (three men and seven women; age range, 4383 years; mean age, 65.4 years) were analyzed on CT scans obtained with 3.75-, 5.0-, and 7.5-mm section thicknesses. The lesions were automatically delineated by using a three-dimensional multicriteria segmentation algorithm. Unidimensional (the largest diameter), bidimensional (the product of the two maximal perpendicular diameters), maximal cross-sectional area, and volumetric measurements were automatically obtained for each pulmonary lesion on each section thickness. Means and variances were calculated, and the differences across the three section thicknesses for each of the four measurements were studied by using linear mixed-effects models. The Levene test was used to study the equality of variances.
RESULTS: Differences in the means for unidimensional, bidimensional, and area measurements were significant between a section thickness of 3.75 and 5.0 mm (unidimensional, P = .05; bidimensional, P = .05; area, P = .01) and 3.75 and 7.5 mm (unidimensional, P = .06; bidimensional, P = .03; area, P = .02), but not 5.0 and 7.5 mm. There was a significant difference in volumetric measurement as section thickness decreased from 7.5 to 5.0 mm (P <.001) and from 7.5 to 3.75 mm (P < .001). Although there was a slight trend for differences in the variances across section thickness for each measurement, none of the differences were significant.
CONCLUSION: Volumetric tumor measurements change with a reduction in section thickness from 7.5 to 5.0 and 3.75 mm. For unidimensional measurement, no change was found when thickness decreased from 7.5 to 5.0 mm.
© RSNA, 2005
| INTRODUCTION |
|---|
|
|
|---|
The evolution of medical imaging technology is toward acquisition of images with faster speed and at higher resolution, with the goals of reducing image artifacts, improvingdetection of small lesions, and assessing subtle changes of disease response or progression. Furthermore, the digital environment of hospitals, including picture archiving and communication system, allows not only superior manual measurements on workstations but also the integration of computerized measurements into the system. Thin-section multidetector row CT and volumetric magnetic resonance (MR) imagers generate large quantities of data, and benefits may be realized in the detection of smaller lesions and in the ability to reconstruct images in multiple planes. A practical issue has been raised as to whether thin-section images are always needed. In other words, what would be the appropriate image resolution for a certain application, such as therapy response assessment, where rules governing response and progression based on change in lesion size have already been established?
In the current clinical practice and in clinical trials, the assessment of the effectiveness of a drug relies on measuring tumor growth or regression by using unidimensional or bidimensional measurements. The historical use of unidimensional and bidimensional measurements is based on the fact that these measurements can be easily performed by a radiologist with handheld calipers for hard-copy images or with electronic calipers at display monitors. Rules governing the regression or growth of a tumor with unidimensional and bidimensional measurements have been established and are routinely used, and suggestions for CT section thickness relative to lesion size have been proposed but not practically tested (1,2). In addition, some groups have advocated the use of volumetric assessment of a tumor (8).
While there is debate regarding the appropriate methods of assessing tumor response, little attention has been paid to the methods of data acquisition. In general, routine site-defined CT scanning protocols are used for phase II and III oncology trials. Section thickness considerations are limited to narrow ranges of possible acquisitions, since frequent serial scanning is generally performed on large (chest, abdomen, and pelvis) areas. Thus, the purpose of our study was to assess the effect of commonly used CT section thicknesses on metastatic tumor measurements calculated with the following four methods: unidimensional, bidimensional, area, and volumetric.
| MATERIALS AND METHODS |
|---|
|
|
|---|
CT scans obtained from 10 patients (three men, seven women; average age, 65.4 years; range, 4383 years) with documented pulmonary metastases (at biopsy or interval growth at sequential scanning) were randomly selected from daily clinical practice. The primary tumors were lung (n = 2), melanoma (n = 1), sarcoma (n = 1), renal (n = 2), colorectal (n = 3), and breast (n = 1). The acquisition protocol for this assessment was a beam pitch of 0.75, 120 kVp, 190290 mAs, pitch of 1.35, and 7.5-mm collimation with 7.5-mm reconstruction interval. In addition to the 7.5-mm reconstruction interval, raw data were reconstructed with additional two narrower intervals, 5.0 and 3.75 mm. Nodules larger than 5 mm were chosen to simulate the actual requirements of the Response Evaluation Criteria in Solid Tumors and of the World Health Organization, as well as the routine daily practice of oncologic radiologists. Forty-two such pulmonary metastases were identified in the 10 patients and were included in this evaluation.
Automated Segmentation and Quantification
An automated three-dimensional multicriteria algorithm was developed previously, making use of the differences in opacity and shape to separate pulmonary lesions from their surrounding tissues on CT images (911). By using this algorithm, a rectangular region of interest, which tightly enclosed the lesion under study, was outlined manually by an operator (L.W. or B.Z.) on one CT section and then propagated automatically to all other sections. To reduce the partial volume effect of CT scanners and to simplify definitions of some three-dimensional operations, the lesion volume of interest was supersampled by using a trilinear interpolation to an isotropic scale in the x-, y-, and z-axes. An initial high attenuation value was determined on the basis of the opacity distribution of the voxels in the lesion volume of interest. The threshold was decreased stepwise until the one subsequent threshold resulted in substantial changes to the gradient strength for the lesion surface (average opacity changes on the lesion surface). If the lesion identified at this threshold level possessed a compact shape, the segmentation terminated. Otherwise, a series of morphologic operations would be applied, starting with a small filter size. As soon as the lesion showed a compact shape, the erosion ceased. Use of the shape criterion introduced to the algorithm could efficiently detach the surrounding high-opacity structures such as blood vessels from pulmonary lesions. The algorithm had been validated with small lung nodule phantoms and kidney stone phantoms on high-spatial-resolution helical CT images and achieved accurate volumetric measurements (12,13).
Once a lesion was separated from its surrounding tissues, its maximal diameter, two perpendicular maximal diameters, maximal area, and volume could be automatically calculated. Explicitly, the area of a lesion on a CT section was the number of lesion pixels on that section multiplied by two in-plane resolutions in x- and y-axes. The area measurement was defined as the largest in-plane area among the lesion areas on all sections containing the lesion. Lesion volume was the sum of lesion areas on each of the lesion sections multiplied by the scan transverse resolution. To obtain uni- and bidimensional diameters of a lesion, the contour of the cross-sectional lesion having the maximal area was extracted. The longest distance of any two pixels on the contour was automatically taken as the largest diameter of the lesion. Along the line of the maximal diameter, perpendicular diameters were then determined, and the longest diameter multiplied by the in-plane resolution (in x- or y-axis) was defined as the largest perpendicular diameter.
The Figure is an example of the automated segmentation and quantification of a lung nodule at multidetector row CT. A 31.9 x 24.2-mm nodule was surrounded by blood vessels in the right lung (within the white outline in part a). Part c displays an intermediate segmentation result, which was obtained by using only the gradient strength criterion, where some of the contiguous vessels were falsely segmented as part of the nodule. After the shape criterion was applied, the vessels were successfully removed from the nodule, whereas the details of the nodule surface were retained (Figure, part d). Part b shows the automatically obtained nodule contours overlapped on the nodule images, the maximal area section, and the two perpendicular diameters.
|
|
|
|
Statistical Analysis
To determine whether section thickness affects the different measurements (unidimensional, bidimensional, area, and volumetric), the means and variances of the different section thicknesses were compared for each of the four measurements.
Linear mixed-effects models (14) were used to compare the differences in the means of the three section thicknesses, while accounting for the correlated nature of the data due to multiple nodules and multiple scans for each patient. These comparisons were made by using the natural logarithm scale to satisfy the assumptions of linear mixed-effects model methods. Multiple comparisons were adjusted by using the method of Tukey (15). To compare variances, permutation tests with a modification of the Levene test for equality of variances proposed by Brown and Forsythe were used (16,17), where we carefully permuted the data to reflect the correlation. For all analyses, we considered P
.05 to indicate a significant difference. The linear mixed-effects model analysis was performed with SAS software (version 8.00; SAS Institute, Cary, NC), and variance comparisons were performed with S-Plus 2000 software (MathSoft, Seattle, Wash).
| RESULTS |
|---|
|
|
|---|
|
|
In contrast, the results of the volumetric measurement do not follow the pattern of the other three measurements. Section thicknesses of 3.75 and 5.0 mm do not produce significantly different volumetric measurements, but a section thickness of 7.5 mm does produce significantly different measurements than do the other two section thicknesses (P < .001 for both comparisons, Table 2).
In an attempt to determine whether resolution affects the precision of the measurements, comparison of the variances of the three section thicknesses with each measurement was made (Table 3). For the unidimensional, bidimensional, and area measurements, the variance in tumor measurements is similar across the section thickness. For the volumetric measurement, the data suggest that a 7.5-mm section thickness may result in higher variability of the measurements, but the difference in variation is not significant (P = .60).
|
| DISCUSSION |
|---|
|
|
|---|
Study findings have shown substantial variability in therapeutic response assessment. Both intraobserver and interobserver variations in tumor measurement exist for a variety of reasons, such as size, irregular shape, and poorly defined margins of a lesion; the phase of intravenous contrast agent administration and the exact levels through which a lesion is scanned; and the measurement technique used (eg, handheld calipers, electronic calipers, or automated techniques) (1823).
Frequently in clinical trials, patients may be scanned with a variety of scanners and a variety of image acquisition protocols. For example, it is not uncommon for a patients baseline study to be performed at one institution and, when the patient enrolls in a protocol, to have the follow-up scans obtained at another institution, which may use different equipment and imaging protocols. The effect of this change needs to be assessed and should be evaluated with reference to the type of measurement obtained. Our results suggest that, especially with unidimensional and bidimensional tumor measurements, there is no difference in varying section thicknesses between 7.5 and 5.0 mm. It is not surprising that when volumetric measurements are used, the differences in volume between various section thickness are great. A recently published article (24) reported that volumetric overestimation of simulated lung tumors on spiral CT images ranged from 15% (38.1 mm in diameter, 2-mm section thickness) to 278% (12.7 mm in diameter, 10-mm section thickness) when a perimeter method (ie, a radiologist manually drew the nodule contour on each section containing the nodule, and the area enclosed by the perimeter was calculated by using computer software) was used. For the first time, to our knowledge, the authors, using the phantom data, derived error-reduction equations to compensate for the overestimation caused by the different section thicknesses and varying nodule sizes. Findings of their preliminary study showed promise of such compensatory equations in reducing the overestimation with clinical subjects (24).
Other factors may have a great effect on lesion size measurements. For instance, the evaluation of focal hepatic metastases will vary greatly depending on the use of a contrast agent and the phase of contrast agent administration. Other soft-tissue mass measurements are affected by lesion border delineation. Lesions with relatively little difference in contrast between the lesion and the surrounding parenchyma may be different to define and measure. It is for this reason that we choose to assess the effect of section thickness on pulmonary nodules, which have relatively well-defined borders.
In our experimental study, we attempted to reduce the measurement errors caused by the partial volume effect of a CT scanner and by the systematic sampling by using a super-resampling technique. Nevertheless, theoretic analysis of these errors is beyond the scope of this study and can be found in other published work (25,26).
Fully automatic computerized methods, including the unidimensional, bidimensional, area, and volumetric measurements, have been implemented and applied to the size estimation of lung nodules on CT images reconstructed at different section thicknesses. One of the advantages of the segmentation algorithm used in this work is its automatic, objective, and consistent way to determine the optimal thresholds to separate nodules from the surrounding tissues. Although the algorithm requires a manual initiation by placing a region of interest that tightly encloses the nodule to be segmented, it can maximally minimize, if not totally eliminate, variations in the segmentation results caused by different selections of the nodule region of interest. To maximally reduce false segmentation results caused by the surrounding tissues of similar opacities, we tried to (a) choose isolated nodules whenever possible, that is, nodules that were not attached to any surrounding tissues, or (b) maximally exclude the contiguous surrounding tissues of similar opacities, such as blood vessels, when drawing the rectangular region of interest. Because of the nature of the algorithmthat is, its use of the average gradient calculated from the nodule surface to determine an optimal thresholdit favors sharper CT images (eg, images reconstructed with the lung convolution kernel), particularly when nodules are small. However, not much attention was paid to the choice of CT reconstruction algorithm since (a) the same imaging protocol was used throughout the study and (b) the nodule sizes of interest were relatively large. How the CT reconstruction algorithm affects the segmentation algorithm is beyond the scope of this study.
Because the in-plane CT resolution is relatively high (0.60.8 mm) compared with the size of the nodule of interest (>5 mm), inaccuracy in the measurement is primarily caused by the lower resolution along the z-axis. Because of partial volume averaging, the largest diameter, perpendicular largest diameters, area, and volume of a nodule may have different attenuation values when measured on an image with a thinner section thickness than when measured on an image with a thicker section thickness. A major limitation of our study was the lack of a standard for the size of these in vivo nodules. However, it was not necessarily the purpose of our study to prove the accuracy of the techniques but to evaluate the differences in the results obtained in terms of both absolute attenuation values and variances.
Our data indicate that the results obtained depend on the type of measurement being performed. We found that measurements performed in a single plane are less "sensitive" to section thickness; thus, there is little advantage in decreasing section thickness from 7.5 to 5.0 mm. Volumetric measurements, however, demonstrate a greater change between 7.5- and 5.0-mm and 7.5- and 3.75-mm section thickness. These data should affect the design and implementation of the imaging portion of clinical trials. In this article, we have reported results of our initial experience. Further analysis with a large sample size and different types of lesions and imaging modalities is required to confirm our results and generalize these guidelines. These data must be interpreted carefully with regard to the actual nodule size selected and the section thickness used. If thinner sections were used or small or larger nodules were included in the sample data set, then the results would be different.
There were a number of limitations to our study. First, we included only pulmonary nodules of a limited size range. The purpose of our study was to assess CT section thickness in therapeutic response assessment, and we chose nodule sizes that are acceptable according to the rules governing Response Evaluation Criteria in Solid Tumors and the World Health Organization, the two most commonly used trial methods. Furthermore, we did not assess other tumor sites because heterogeneity in liver lesions and the complex size and/or shape of lymph node metastases will affect measurements in other ways not related to section thickness alone. We could have chosen to evaluate other section thicknesses; however, those evaluated in our study are the most commonly used worldwide in clinical trials and are also suggested in response criteria guidelines.
In conclusion, CT section thickness reconstruction will affect the measurement of tumors. The measurement of tumors is affected by the magnitude of change in the CT section thickness reconstruction, as well as by the methods of tumor measurement. Our data indicate that the largest effect of change is between 3.75 and 7.5 mm and in volumetric measurement. The smallest effect is between 7.5 and 5.0 mm and in unidimensional measurement.
| FOOTNOTES |
|---|
Author contributions: Guarantors of integrity of entire study, L.H.S., B.Z.; study concepts, L.H.S., B.Z., M.S.G.; study design, L.H.S., B.Z., C.S.M.; literature research, B.Z., L.H.S.; clinical studies, L.W., L.J., B.Z., L.H.S., M.S.G.; data acquisition, J.P.K., C.A.C., L.J.; data analysis/interpretation, C.S.M., L.H.S., B.Z.; statistical analysis, C.S.M., L.H.S., B.Z.; manuscript preparation, editing, and final version approval, B.Z., L.H.S., C.S.M.; manuscript definition of intellectual content and revision/review, B.Z., L.H.S.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. Zhao, L. P. James, C. S. Moskowitz, P. Guo, M. S. Ginsberg, R. A. Lefkowitz, Y. Qin, G. J. Riely, M. G. Kris, and L. H. Schwartz Evaluating Variability in Tumor Measurements from Same-day Repeat CT Scans of Patients with Non-Small Cell Lung Cancer Radiology, July 1, 2009; 252(1): 263 - 272. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Nietert, J. G. Ravenel, W. M. Leue, J. V. Miller, K. K. Taylor, E. S. Garrett-Mayer, and G. A. Silvestri Imprecision in Automated Volume Measurements of Pulmonary Nodules and Its Effect on the Level of Uncertainty in Volume Doubling Time Estimation Chest, June 1, 2009; 135(6): 1580 - 1587. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Gavrielides, L. M. Kinnard, K. J. Myers, and N. Petrick Noncalcified Lung Nodules: Volumetric Assessment with Thoracic CT Radiology, April 1, 2009; 251(1): 26 - 37. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Ravenel, W. M. Leue, P. J. Nietert, J. V. Miller, K. K. Taylor, and G. A. Silvestri Pulmonary Nodule Volume: Effects of Reconstruction Parameters on Automated Measurements--A Phantom Study Radiology, May 1, 2008; 247(2): 400 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Petrou, L. E. Quint, B. Nan, and L. H. Baker Pulmonary Nodule Volumetric Measurement Variability as a Function of CT Slice Thickness and Nodule Morphology Am. J. Roentgenol., February 1, 2007; 188(2): 306 - 312. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Zhao, L. H. Schwartz, C. S. Moskowitz, M. S. Ginsberg, N. A. Rizvi, and M. G. Kris Lung Cancer: Computerized Quantification of Tumor Response--Initial Results Radiology, December 1, 2006; 241(3): 892 - 898. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Jaffe Measures of Response: RECIST, WHO, and New Alternatives J. Clin. Oncol., July 10, 2006; 24(20): 3245 - 3251. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |