|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Thoracic Imaging |
1 From the Departments of Radiology (H.A.G., W.P.T.M.M., M.P.) and Medical Oncology (G.G.), University Medical Center, Heidelberglaan 100, 3584 CX Utrecht, the Netherlands; and Department of Radiology, Academic Medical Center, Amsterdam, the Netherlands (C.M.S.). From the 2005 RSNA Annual Meeting. Received June 18, 2006; revision requested August 23; revision received October 30; accepted November 22; final version accepted April 3, 2007. Address correspondence to H.A.G. (e-mail: h.gietema{at}umcutrecht.nl).
| ABSTRACT |
|---|
|
|
|---|
Materials and Methods: This study had institutional review board approval; written informed consent was obtained from all patients. Between June 2004 and March 2005, 20 patients (15 men, five women; age range, 40–84 years; mean age, 57 years) referred for chest CT for known lung metastases underwent two additional low-dose chest CT examinations without contrast material (collimation, 16 x 0.75 mm). Between these examinations, patients got off and on the table to simulate the conditions for a follow-up examination. Noncalcified solid pulmonary nodules between 15 and 500 mm3 that did not abut vessel or pleura were measured in both studies by using widely applied commercial semiautomated software. Interscan variability was established with the Bland and Altman approach. The impact of nodule shape (spherical or nonspherical) on measurement variability was assessed by using one-way analysis of variance, while the contributions of mean nodule volume and change in lung volume were investigated with univariate linear regression for completely (group A) and incompletely (group B) segmented nodules.
Results: Two hundred eighteen eligible nodules (volume range, 16.4–472.7 mm3; 106 spherical, 112 nonspherical) were evaluated. The 95% confidence interval for difference in measured volumes was –21.2%, 23.8% (mean difference, 1.3%). The precision of nodule segmentation was highly dependent on nodule shape (P < .001) and was weakly related to inspiration level for completely segmented nodules (r = –0.20; P < .047), while mean nodule volume did not show any effect (P = .15 and P = .81 for group A and B nodules, respectively).
Conclusion: Variation of semiautomated volume measurements of pulmonary nodules can be substantial. Segmentation represents the most important factor contributing to measurement variability, while change in inspiration level has only a weak effect for completely segmented nodules.
© RSNA, 2007
| INTRODUCTION |
|---|
|
|
|---|
Volume doubling times of malignant pulmonary nodules may vary between 30 and 400 days (6). For small or moderately growing nodules, the increase in size over a follow-up period of up to 1 year will be only in the range of a few voxels (2,6). Therefore, it is crucial to distinguish between real but slow changes in size and other factors that influence volume measurements. Patient position, heart pulsation, and inspiration levels have been hypothesized to influence the assessment of nodule size (3). Interscan variability has already been shown to be an important factor by Wormanns et al (8), but only patient positioning was considered in that study. Kostis et al (3) evaluated the reproducibility of repeat volume measurements of stable small pulmonary nodules that showed no increase or decrease in size for more than 2 years. As a consequence of the inclusion criteria, the vast majority of nodules were smaller than 5 mm in diameter, and no distinction could be made between truly identical volumes over the follow-up period and minor changes in size during this period.
The purpose of our study was to prospectively assess the precision of semiautomated volume measurements of pulmonary nodules at low-dose multi–detector row computed tomography (CT) and to investigate the influence of nodule size, segmentation algorithm, and inspiration level.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Even though the patients also had larger lesions, we included only nodules smaller than 500 mm3, a size that corresponds to a mean diameter of approximately 10 mm (exact value, 9.85 mm), because the commercially available algorithm we applied (LungCare; Siemens Medical Solutions, Erlangen, Germany) has been released for use with nodules smaller than 10 mm in diameter. The minimum nodule volume we included in this evaluation was 15 mm3 (corresponding to a diameter of about 3 mm). We included not only nodules suspected of being metastases but also nodules that could potentially have benign histologic features. Completely calcified nodules, however, were excluded. We included only solid nodules in our evaluation because the current software is not released for use in volume measurements of nonsolid and part-solid nodules.
Image Acquisition
We performed two low-dose chest CT examinations without contrast material, followed by a contrast material–enhanced standard-dose chest CT examination for clinical purposes. Between the two low-dose examinations, patients were asked to get off and on the table to simulate the conditions of a repeat examination for follow-up of a pulmonary nodule. With that setup, growth or decrease in size of the lung lesions could reliably be excluded.
All scans were acquired with a 16-section CT scanner (Mx8000 IDT; Philips Medical Systems, Cleveland, Ohio) by using a spiral mode with 16 x 0.75-mm collimation. The entire chest was scanned in about 10 seconds by using a caudocranial scan direction. Examinations were performed in full inspiration after appropriate instructions were given to the patients. So as to reproduce the standard situation in a screening setting, we used no spirometric control or respiratory belt. Exposure settings for the additional low-dose examinations were 30 mAs and 120 kVp (volume CT dose index [CTDIvol] = 2.2 mGy) for patients who weighed 80 kg or less and 30 mAs and 140 kVp for those who weighed more than 80 kg (CTDIvol = 3.5 mGy). Transverse images were reconstructed at a thickness of 1.0 mm and a 0.7-mm increment by using a moderately soft reconstruction kernel (kernel B), the smallest field of view that included the outer rib margins at the widest dimension of the thorax, and a 512 x 512 matrix.
Semiautomated Volume Measurements of Pulmonary Nodules
Data were transferred from the CT scanner to a digital workstation (Leonardo; Siemens Medical Solutions) with commercially available software for semiautomated volume measurements (LungCare; Siemens Medical Solutions). Nodules were identified by a single observer (H.A.G., who had 1 year of experience in radiology and who had been trained for this specific task) by using transverse thin-slab maximum intensity projections (slab thickness, 10 mm) that were displayed with window width and level settings of 1500 and –500 HU, respectively. After a candidate nodule is manually marked with a mouse click in the center of the nodule, the program automatically defines a volume of interest around the nodule that can be analyzed by using volume-rendered displays. Quantitative evaluation of a nodule is initiated with a second mouse click, which starts an automated volume measurement program that has been described previously (8). In this step, the nodule of interest is segmented and the volume of the segmented area is calculated. This segmented area is shown by a yellow overlay on the nodule. No manual interaction was performed to correct mismatches.
To minimize the influence of the separation process, which distinguishes between the nodule itself and adjacent structures, we included only lesions that had no direct contact with the pleura or vessels. That way, it was ensured that we focused solely on the precision of measurement of the volume of an isolated solid lesion. For each patient, all nodules with a volume of 15–500 mm3 that met the inclusion criteria were included for evaluation.
Nodules in the second study were identified with knowledge of findings on the first study; the nodules were matched by using the combination of section number, lung segment, and distance to the pleura.
Evaluation of Effect of Segmentation Performance
We visually determined the precision of the measurement software by assessing whether the nodule was completely segmented. Nodules were categorized into two groups (A and B) on the basis of whether the yellow overlay completely matched the nodule (group A) or whether visual assessment determined a mismatch (group B). Mismatch was defined as the visually obvious exclusion of a part of the investigated nodule from the segmented area.
Evaluation of Nodule Shape
In a separate reading session, an experienced observer (M.P., with 20 years of experience in radiology) visually assessed nodule shape and categorized nodules into three subgroups. A nodule was defined as spherical when it had a constant radius and as lobular when it had a variable radius but smooth outer margins. It was defined as irregular when the outer margins were not smooth.
To investigate the effect of nodule shape on the performance of the segmentation algorithm, we determined how many nodules were spherical, nonspherical, or irregular for completely (group A) or incompletely (group B) segmented nodules.
Evaluation of Inspiration Level and Lung Volume
To compare the inspiration levels of the two scans, we calculated the lung volume for every scan by using completely automated software developed in house. The algorithm is similar to one described previously (9). The lungs are segmented from adjacent soft-tissue structures (eg, the mediastinum, vascular structures, and the chest wall) by region-growing from an automated starting point in the trachea, including all connected areas with attenuation of less than –500 HU. In a second step, the trachea and main bronchi are excluded from the lungs. The number of voxels within the segmented lungs is multiplied by voxel size to calculate total lung volume.
Natural variation in inspiration level between the two low-dose scans was established as the ratio between the lung volume on the second scan and the lung volume on the first scan.
Statistical Evaluation
Nodule volumes are given as means ± standard deviations. Reproducibility of volume measurements was assessed by correlating nodule volumes on both scans by using the Spearman correlation coefficient for non–normally distributed populations.
Differences in volume (
V) were calculated by subtracting the volume measured on the first scan (V1) from the volume measured on the second scan (V2). These differences were plotted against the mean nodule volume by using the approach described by Bland and Altman (10):
![]() |
The differences in volume measurements were normalized with respect to mean nodule volume to assess relative differences with the following formula:
![]() |
Limits of agreement were given as 95% confidence intervals. Interscan variability was defined as the 95% confidence interval of the relative differences. An increase in nodule volume above these upper limits of agreement can, with 95% confidence, be attributed to real growth.
Because the shape of a nodule was a binary variable, an analysis of variance test to compare the variances of both groups was performed to assess the effect of nodule shape on the relative difference in measured volumes. To assess the effects of inspiration level, we performed univariate regression for both groups, with the relative difference as the dependent variable and the ratio of lung volumes as the independent variable. To assess the impact of nodule volume on measurement variation for both groups, we performed univariate linear regression with the coefficient of variation (standard deviation divided by mean volume) of both measurements as the dependent variable and the mean nodule volume as the independent variable. All statistics were calculated with software (SPSS, version 12.0; SPSS, Chicago, Ill). P values less than .05 were considered to indicate significant differences.
| RESULTS |
|---|
|
|
|---|
The mean volume of all nodules was 123.0 mm3 ± 101.9 (range, 16.4–472.7 mm3; median, 82.7 mm3). The number of eligible nodules per patient ranged from zero (no metastases visible after therapy) in four patients to 62 nodules in one patient. The median number of nodules per patient was six. None of the patients had only nodules that were larger than 500 mm3.
One hundred six nodules were spherical, 30 nodules had a nonspherical shape, and 82 nodules showed irregular margins.
Reproducibility of Volume Measurements
The reproducibility of volume measurements for the whole group was excellent, with a Spearman correlation coefficient of 0.99 (Fig 1). For the total group of nodules, the mean difference in volume measurements amounted to 2.4 mm3, ranging from –53.0 to 120.8 mm3 (95% confidence interval: –32.0, 36.7 mm3) (Fig 2a). The mean relative difference amounted to 1.3%, with a 95% confidence interval of –21.2%, 23.8% (Fig 2b).
|
|
|
|
|
|
Interscan variability was –11.9% to 12.4% for completely segmented nodules. For incompletely segmented nodules, this interval was more than twice as large: –26.8% to 30.0%. The maximum absolute difference between measured volumes on consecutive scans was 15.35 mm3 for completely segmented nodules and 120.79 mm3 for incompletely segmented ones. The maximum relative difference between volumes was 15% for completely segmented and 37% for incompletely segmented nodules. The measurement variability, as given by the width of the 95% confidence interval standard deviations, was 2.3 times smaller for completely segmented nodules than for incompletely segmented ones. This difference was statistically significant (P < .002, F test).
Ratio of lung volume between the second and the first scan ranged from 88% to 116% (Fig 4). Lung volume calculation failed in five patients because of a high number of nodules (n = 2) or fibrosis (n = 3), while lung volumes of the four patients without nodules could not be used for analysis.
|
For completely segmented nodules, the relative difference decreased with increasing ratio of inspiration level (r = –0.20, P = .047), while for incompletely segmented nodules, we found no correlation (P = .67).
| DISCUSSION |
|---|
|
|
|---|
Wormanns et al (8) applied a study design comparable to ours and used the same type of semiautomated software. Although we used thinner collimation and included a higher number of nodules, we found comparable limits of agreement: –21.2% to 23.8% in our study versus –20.6% to 21.9% in the study by Wormanns et al. Wormanns and colleagues discussed that both ill-defined shape and attachment to pleura or vessels of some of the included nodules could explain why reported variability was higher in vivo than what has been seen in phantom studies. However, they did not further analyze the contribution of these factors separately. We specifically excluded pleura-based and vessel-attached nodules to be able to assess these factors separately. The reason we still found similar limits of agreement despite these exclusion criteria is likely to be due to the higher number of irregularly shaped nodules in our study than in the group of nodules analyzed by Wormanns et al.
Although LungCare uses a global thresholding method for the segmentation of a candidate nodule, the algorithm reported by Kostis et al (12) uses a more sophisticated segmentation approach that was developed to deal with irregular shapes. With their algorithm, Kostis et al also found an effect of the initial nodule size on the extent of variability, while Wormanns et al had reported similar limits of agreement for different size ranges of nodules. We could not prove a significant effect of lesion size on the variability of measurements for nodules that had been perfectly segmented. One could argue that the precision of the segmentation algorithm could have been improved by manual modification. Our software offers the possibility of adapting the segmented area by modifying the cutoff value of the cross-section curves. The means of that adaptation process, however, are limited with respect to the fact that modification of the cutoff value is only valid for the complete circumference of the nodule. As a consequence, moving the cutoff value of the cross-section curve to the right (resulting in increased overlaid volume) helps to include parts of the nodules that were originally not included, but at the expense of including surrounding parts, leading to an overestimation of the volume. Other important aspects are that the use of any manual adaptation or modification of the volumetric measurement would not only increase the radiologist's reading time but would also introduce another variable.
Changes in inspiratory level turned out to be another source of variability. We found that higher inspiration level led to a decrease in measured volume. This finding is most likely due to the fact that the attenuation of the surrounding lung parenchyma changes with inspiratory level, leading to an alteration of the cutoff value and thus the volume assessment. Although the contribution of inspiratory level was statistically significant, the quantities of volume changes introduced by it were only small.
Other potential sources of variability are interobserver and intraobserver variability. Because in our study, the same single observer measured all nodules, interobserver variability did not contribute to the variability of our measurements. We did not specifically evaluate intraobserver variability, but Wormanns and colleagues have already shown that the effect of intraobserver variability is negligible compared with the effect of interscan variability (8).
The main limitation of our study was that all results reported are valid only for the particular software release we used. Although this program is widely used, our results are not transferable to other algorithms. The limitation we found is related to the fact that a global thresholding method is used for the segmentation of a candidate nodule. Algorithms that use a more sophisticated segmentation algorithm are likely to also achieve a higher precision for irregularly shaped nodules.
In conclusion, the precision of the commercially available software tool tested was dependent on lung nodule morphology and was found to be less reproducible for nonspherical than for spherical solid nodules. The extent of variability decreased with increasing nodule size and higher inspiration level.
Taking the reported variation into account, the threshold for calling an increased measured volume of a lung nodule a real volume increase with 95% confidence lies at a 30% increase for an irregularly shaped lesion. For spherical nodules, this threshold can be lowered to 15%.
| ADVANCES IN KNOWLEDGE |
|---|
|
|
|---|
| IMPLICATION FOR PATIENT CARE |
|---|
|
|
|---|
| FOOTNOTES |
|---|
Authors stated no financial relationship to disclose.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. J. Nietert, J. G. Ravenel, W. M. Leue, J. V. Miller, K. K. Taylor, E. S. Garrett-Mayer, and G. A. Silvestri Imprecision in Automated Volume Measurements of Pulmonary Nodules and Its Effect on the Level of Uncertainty in Volume Doubling Time Estimation Chest, June 1, 2009; 135(6): 1580 - 1587. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Rampinelli, E. De Fiori, S. Raimondi, G. Veronesi, and M. Bellomi In Vivo Repeatability of Automated Volume Calculations of Small Pulmonary Nodules With CT Am. J. Roentgenol., June 1, 2009; 192(6): 1657 - 1661. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Marchiano, E. Calabro, E. Civelli, G. Di Tolla, L. F. Frigerio, C. Morosi, F. Tafaro, E. Ferri, N. Sverzellati, T. Camerini, et al. Pulmonary Nodules: Volume Repeatability at Multidetector CT Lung Cancer Screening Radiology, June 1, 2009; 251(3): 919 - 925. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Gavrielides, L. M. Kinnard, K. J. Myers, and N. Petrick Noncalcified Lung Nodules: Volumetric Assessment with Thoracic CT Radiology, April 1, 2009; 251(1): 26 - 37. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Hochhegger, A. A. Bankier, C. R. O'Donnell, and P. M. Boiselle Re: Respiratory Instructions for CT Examinations of the Lungs RadioGraphics, March 1, 2009; 29(2): 637 - 638. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |