Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


DOI: 10.1148/radiol.2343031009
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hayashino, Y.
Right arrow Articles by Fukui, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hayashino, Y.
Right arrow Articles by Fukui, T.
(Radiology 2005;234:740-748.)
© RSNA, 2005


Evidence-based Practice

Ventilation-Perfusion Scanning and Helical CT in Suspected Pulmonary Embolism: Meta-Analysis of Diagnostic Performance1

Yasuaki Hayashino, MD, MPH, Masashi Goto, MD, MPH, Yoshinori Noguchi, MD, MPH and Tsuguya Fukui, MD, MPH

1 From the Department of General Medicine and Clinical Epidemiology, Kyoto University Graduate School of Medicine, Kyoto, Japan. Received June 26, 2003; revision requested September 9; final revision received May 21, 2004; accepted June 23. Address correspondence to T.F. Department of Medicine, St Luke’s International Hospital, 9–1 Akashi-cho, Chuo-ku, Tokyo, 104-8560, Japan. (e-mail: fkts@luke.or.jp).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
PURPOSE: To perform meta-analysis of literature about the role of helical computed tomography (CT) and ventilation-perfusion (V-P) scanning in detection of acute pulmonary embolism (PE) by using summary receiver operating characteristic (ROC) curve analysis.

MATERIALS AND METHODS: V-P scanning articles published from January 1985 to March 2003 and helical CT articles published from January 1990 to March 2003 in MEDLINE and EMBASE databases were included if (a) tests were performed for evaluation of acute PE, (b) conventional angiography was the reference standard, and (c) absolute numbers of true-positive, false-negative, true-negative, and false-positive results were available. Sensitivity analysis was conducted by excluding articles published before 1995.

RESULTS: A total of 12 articles discussing helical CT and/or V-P scanning were included. With a random-effects model, pooled sensitivity for helical CT was 86.0% (95% confidence interval [CI]: 80.2%, 92.1%), and specificity was 93.7% (95% CI: 91.1%, 96.3%). V-P scanning yielded low sensitivity of 39.0% (95% CI: 37.3%, 40.8%) but high specificity of 97.1% (95% CI: 96.0%, 98.3%) with high probability threshold. V-P scanning yielded high sensitivity of 98.3% (95% CI: 97.2%, 99.5%) and low specificity of 4.8% (95% CI: 4.7%, 4.9%) with normal threshold. Regression coefficients for helical CT angiography were 0.588 (95% CI: –1.55, 2.74) and 4.14 (95% CI: –0.002, 8.28) versus V-P scanning with high and normal thresholds, respectively. Regression coefficients for helical CT angiography were 0.588 (95% CI: –1.55, 2.74) and 4.14 (95% CI: –0.002, 8.28) versus V-P scanning with high and normal thresholds, respectively.

CONCLUSION: Helical CT has greater discriminatory power than V-P scanning with normal and/or near-normal threshold to exclude PE, while helical CT and V-P scanning with high probability threshold had similar discriminatory power in the diagnosis of PE.

© RSNA, 2005


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The diagnosis of pulmonary embolism (PE) remains difficult because the symptoms are not specific and because all available tests have substantial limitations in clinical settings (1,2). Selective pulmonary angiography remains the reference standard; however, this procedure is invasive and causes morbidity and mortality rates of 4% and 0.2%, respectively (3,4). Moreover, it is costly and time-consuming. Several minimally invasive modalities have been used to facilitate detection of PE. Because of its general availability, ventilation-perfusion (V-P) scanning is one of the most frequently used procedures in the confirmation or exclusion of PE. The Prospective Investigation of Pulmonary Embolism Diagnosis (PIOPED) study (5) assessed the usefulness of V-P lung scanning in patients with acute PE. Despite accuracy rates as high as 96% in some instances, V-P scanning alone is inadequateto confirm or exclude diagnosis of PE because up to 75% of patients are in the inconclusive category; therefore, other modalities, such as serial compression ultrasonography and D-dimer testing, are useful adjuncts (6,7).

Contrast agent–enhanced helical computed tomography (CT) of the pulmonary arteries has been proposed, and data are accumulating (817). The choice between V-P scanning and helical CT should be determined by using available data to compare diagnostic accuracy. Interpretation of the data as a whole is difficult because of the wide variation in the background of the patients. In prior reports that compare the test performance of helical CT and V-P scanning (18,19), the difference in test performance has not been systematically compared with a statistical method frequently used in meta-analysis of diagnostic tests.

Summary receiver operating characteristic (ROC) analysis is a method that enables quantitative combination of the multiple studies with heterogeneous results; each point on the summary ROC curve represents a combination of sensitivity and specificity that could result from each study (2022). Heterogeneity among different studies is caused either by differences between the way clinicians define a test as positive for PE or by wide variation in terms of the patients’ background; summary ROC curve analysis could resolve those problems by means of stringent inclusion criteria and meta-regression analysis. It is suitable to compare the performance of different diagnostic tests, and reports involving the use of this analysis have been accumulating (23,24). Thus, the purpose of our study was to perform meta-analysis of the helical CT and V-P scanning literature by using the methodologic tool of summary ROC analysis.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Identification of Trials
A computerized search was performed to identify relevant English-language articles published in MEDLINE. To search for articles about helical CT, we used "computed tomography" in the text, "image interpretation, computer-assisted" in medical subject headings, and "pulmonary embolism" in medical subject headings or text. Similarly, we searched for articles about V-P scanning by using "radionuclide imaging" in medical subject headings and "perfusion scan" or "ventilation-perfusion lung scan" in the text. The search for helical CT articles was limited to articles published between January 1, 1990, and May 30, 2003, since earlier CT equipment was substantially different. We included articles on V-P scanning published from January 1, 1985, to May 30, 2003, since this technique was used in the diagnosis of PE beginning in the mid-1980s. We also scanned references in retrieved articles and contacted individuals who are knowledgeable in radiology to see if they knew of any other relevant reports. A database search with a similar strategy was conducted in the EMBASE database for articles published between 1985 and 2003. We did not search for unpublished reports.

Inclusion Criteria
We included a study if (a) helical CT or V-P scanning was used as a diagnostic tool for acute PE; (b) absolute numbers of true-positive, false-positive, true-negative, and false-negative cases or their equivalent were given; (c) pulmonary angiography was used as the reference standard for diagnosis of PE; and (d) the time interval between the findings obtained from the test and reference standard was 48 hours or less, taking into account the fact that PE might disappear during the interval of two tests. We determined this time interval by reviewing literature of diagnostic tests in patients with PE (25,26) and discussing this issue with radiologists.

A study was excluded if (a) pulmonary angiography, in combination with any other modality, served as the reference standard; (b) helical CT was not performed for acute PE (eg, chronic PE or septic embolism); (c) noncomparable CT methods (eg, electron-beam CT) were used; (d) helical CT was performed after anticoagulant therapy or surgery for PE; or (e) the published information was incomplete. We tried to see if different studies from the same institution used the same patients because one author published several reports.

Data Collection
Two investigators (Y.H., M.G.) independently abstracted the data from all articles included in our analysis. The information abstracted included descriptive data (eg, authors, title, journal citation, and year of publication), study group characteristics (eg, sample size, mean age, proportion of women, and prevalence of PE), study design characteristics (involving criteria used to define a positive result and protocol information), extent of blinding between readers, information about the extent of the disease, and any evidence of verification bias and test interpretation bias.

For each study, the results were classified as true-positive, false-positive, true-negative, and false-negative. For V-P scanning, PIOPED criteria are generally used according to the probability of PE (eg, high, intermediate, low, or near normal or normal) (5). We specify three criteria for calculating sensitivity and specificity (Table 1). High-probability V-P scanning findings were positive, and others (eg, intermediate probability, low probability, or near normal and/or normal) were negative (threshold 1). High- and intermediate-probability V-P scanning findings were positive, and low-probability and normal and/or near-normal V-P scanning findings were negative (threshold 2). Normal and/or near-normal V-P scanning findings were negative, and others (eg, high probability, intermediate probability, and low probability) were positive (threshold 3). In one study, researchers developed and used original criteria that consisted of five categories; these five categories were reduced to four to match PIOPED criteria (27). As for helical CT, the presence or absence of PE, defined as an intraluminal filling defect or complete nonfilling of a pulmonary artery, was used as a criterion for determining positive or negative status.


View this table:
[in this window]
[in a new window]

 
TABLE 1. Comparison between Thresholds and Probabilities according to PIOPED Criteria

 
From the articles in which investigators tabulated the results for different observers, we extracted data for the first observer, unless one observer was emphasized in the literature. When authors emphasized one result (eg, level of experience) in different observers, we extracted the most emphasized results. Any inconsistencies or controversies encountered in abstracted data were resolved with discussion and consensus.

Analysis and Statistics
The overall suitability of the pooled and summary ROC curve analysis was evaluated by using the Spearman correlation coefficient (22). We then checked heterogeneity separately for sensitivity and specificity. Since sensitivities for helical CT and specificities for V-P scanning threshold 3 were not homogeneous (P = .006 and P < .001, respectively), pooled sensitivity and specificity estimates were calculated by using a random-effects model that weighted each report according to its sample size (28).

To estimate the summary ROC curve for helical CT and V-P scanning, we used a previously described method of variance-weighted least squares regression (20,22,23,28). On the basis of the 2 x 2 table constructed from each report, we made a logit transformation of the true-positive (eg, sensitivity) and false-positive (eg, 1 minus specificity) rates. Differences in the logit transformations (eg, measure of the observed discriminatory power of helical CT and V-P scanning) were then regressed on the sums of the logit transformations (eg, measure of the positivity threshold used to determine positive helical CT and V-P scanning results). Summary ROC curves for helical CT and V-P scanning were constructed with back transformation of the fitted line from the regression model. We weighted each study in the regression model by its variance with the following equation: [1/(true-positive + 0.5)] + [1/(false-positive + 0.5)] + [1/(false-negative + 0.5)] + [1/(true-negative + 0.5)] (20). We restricted the final summary ROC curves to the range of observed true-positive and false-positive rates.

Adjustment for clinical variables was accomplished by including them in the regression model. Inclusion of a dummy variable in the regression analysis for the type of diagnostic examination performed (eg, 1 for helical CT and 0 for V-P scanning) allows comparison of tests. The regression coefficient of this dummy variable is a measure of the difference in discriminatory power between the examinations. A positive regression coefficient implies increased discriminatory power for helical CT compared with V-P scanning, and a negative regression coefficient implies reduced discriminatory power. To avoid undefined values for diagnostic odds ratio, positivity criteria, and the variance that arises from zeroes of the true-positive, false-negative, true-negative, or false-positive values, 0.5 was added to that value (20).

We assessed the effect of publication year, mean age (55 years or younger vs older than 55 years), prevalence of PE, duration of tests (<24 hours vs <48 hours), study design (prospective vs retrospective), presence of interpretation bias, and presence of verification bias (eg, presence of verification bias vs no available information) in a combined model of helical CT and V-P scanning. We could not consider the effect of the extent of the disease in the model used to compare helical CT and V-P scanning, since information about the extent of the disease was not available in the literature that mainly dealt with V-P scanning (5,27,29).

We dichotomized some variables (eg, age, percentage of women, and duration between tests) at median. Because of the availability of data (eg, data on collimation were only available for helical CT) or missing data, the following variables were analyzed separately in each model: percentage of women included in the study (eg, ≤25% vs >25%), collimation (eg, 3 mm or thinner vs thicker than 3 mm), size of PE (eg, segmental vs subsegmental) for helical CT model, and type of radionuclide (eg, technetium 99m [99mTc] diethylenetriaminepentaacetic acid [DTPA] vs other types of radionuclides) used for V-P scanning. In the combined model, univariate analysis was performed to enable the effect of each clinical covariate to be assessed.

We added the factors that had a P value of less than .20 at univariate analysis into a multivariate regression model and used backward elimination to remove variables with a P value of more than .05. For the main aim of this study, a dummy variable for the type of diagnostic test (helical CT = 1) was always kept in this process. In separate and combined models, V-P scanning data were treated separately in different circumstances (eg, helical CT vs V-P scanning threshold 1, helical CT vs V-P scanning threshold 2, and helical CT vs V-P scanning threshold 3). Finally, we reanalyzed the final model with random-effects regression analysis (Technical bulletin no. 42; Stata Statistical Software, College Station, Tex), which took inter- and intrastudy variability into account.

After sensitivities and specificities were pooled, we assessed the posttest probability of PE on the basis of different pretest probabilities (low = .03, moderate = .27, high = .78). The arbitrary pretest probabilities of .03, .27, and .78 were based on the report by Wells et al (1), in which pretest probability was determined by using clinical signs and symptoms. First, pretest odds were converted into posttest odds by multiplying the pretest odds by the likelihood ratio. Likelihood ratio is defined as the probability of the test result in people with the disease divided by the probability of the test result in people without the disease. Posttest odds were converted back to posttest probabilities (30).

Since the helical CT method used in the detection of PE has undergone rapid changes in the past decade, the test performance might have changed from the early 1990s to 2003. Thus, we performed a sensitivity analysis by excluding helical CT articles that were published before 1995 and compared the results with those in base-case analysis. All analyses were performed by using commercially available software (Intercooled Stata 7.0; StataCorp, College Station, Tex).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Summary of the Literature Review and Data Extraction
Our initial data search yielded a total of 1385 titles of studies that used helical CT or V-P scanning. We excluded 1306 articles by reviewing titles and abstracts and selected 79 possible articles for our analysis; the reasons for exclusion are summarized in Figure 1. Of these 79 articles, we identified 12 that met all the inclusion criteria (5,811,1317,27,29). Two of these articles also reported test performance of both helical CT and V-P scanning (10,14); thus, we had nine articles for helical CT and five articles for V-P scanning. Two studies were reported by the same lead author (Remy-Jardin) (8,11), but these studies were judged not to be overlapped because of the different study periods and different amount of contrast agent used. Overall, there were discrepancies between the two authors in 19 (13.6%) of 140 extracted items, ranging from 0% to 35.7% depending on the sort of items extracted. All discrepancies were resolved by consensus.



View larger version (38K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Flow chart shows results of literature search. hCT = helical CT, PA = pulmonary angiography.

 
Helical CT and V-P Scanning
Helical CT was used to aid diagnosis of PE for the first time in 1992 (8). Of 12 articles we identified, nine collectively reported findings in 520 subjects (range, 10–151 subjects per study) who underwent helical CT (811,1317), while five reported findings in 1269 subjects (range, 20–731 subjects) who underwent V-P scanning (5,10,14,27,29).

Variations in study protocols included thickness of the scanning section (range, 2.5–5.0 mm) and the amount of contrast agent (range, 70–150 mL). As for detector system, single–detector row helical CT was used in eight studies, and dual–detector row helical CT was used in one study (17). The reported sensitivity of helical CT ranged from 53% (13) to 100% (8,9), and specificity ranged from 75% (17) to 100% (9,11,14).

Ventilation studies used xenon 133 gas (133Xe), 99mTc-pyrophosphate (PYP), or 99mTc-DTPA as the nuclear isotope, and perfusion studies used 99mTc-macroaggregated albumin (MAA) as the nuclear isotope. PIOPED criteria were used in four studies, and original criteria were formulated in one study (27). The reported specificity of V-P scanning with threshold 1 ranged from 96.0% (5) to 100% (10,14). With threshold 2, sensitivity ranged from 54.5% (10) to 100% (14), and with threshold 3, sensitivity ranged from 98% (5) to 100% (10,14,27,29). Details of the articles included are summarized in Table 2.


View this table:
[in this window]
[in a new window]

 
TABLE 2. Clinical Characteristics of Studies Included in Meta-Analysis

 
Weighted Pooled Results
Weighted pooled data are presented in Table 3. Compared with helical CT, V-P scanning yielded a sensitivity of 39.0% (95% CI: 37.3%, 40.8%) with threshold 1, while it yielded sensitivities of 86.0% (95% CI: 83.3%, 88.8%) and 98.4% (95% CI: 97.2, 99.5) with thresholds 2 and 3, respectively. V-P scanning yielded a specificity of 97.1% (95% CI: 96.0%, 98.3%) with threshold 1 and a sensitivity of 39.0% (95% CI: 37.3%, 40.8%) and 4.8% (95% CI: 4.7%, 4.9%) with thresholds 2 and 3, respectively.


View this table:
[in this window]
[in a new window]

 
TABLE 3. Pooled Sensitivity and Specificity of Helical CT and V-P Scanning with Different Thresholds

 
Summary ROC Analysis
No significant predictors were found in the separate univariate analysis for helical CT (eg, collimation or size of PE) or V-P scanning (eg, type of radionuclide used for V-P scanning). Univariate analysis for the comparison of two tests revealed that the following variables had a P value of less than .20: age in the model (P = .032), including V-P scanning threshold 1; duration between two tests (P = .027) and presence of verification bias (P = .067) in the model, including V-P scanning threshold 2; and duration between two tests (P = .111) and presence of verification bias (P = .146) in the model, including V-P scanning threshold 3. Thus, we included these variables in each multivariate model; however, as a result of backward elimination, no significant predictors were kept in the final model. In a final model used to compare helical CT and V-P scanning threshold 2, helical CT displayed superior discriminatory power, with a ß coefficient of 3.73 (95% CI: 2.56, 4.9). In a model used to compare performance of helical CT with that of V-P scanning threshold 3, the estimate was large enough to suggest that helical CT was superior to V-P scanning, although this difference was not statistically significant (ß coefficient, 4.14; P = .05). In the comparison of helical CT and V-P scanning threshold 1, no significant difference was observed in terms of discriminatory power (ß coefficient, 0.588; P = .457). These results are shown in Table 4 and Figure 2.


View this table:
[in this window]
[in a new window]

 
TABLE 4. Comparison of Summary ROC Analyses between Helical CT and V-P Scanning

 


View larger version (23K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2a. (a) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 1. (b) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 2. (c) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 3. (a-c) Horizontal axis represents the false-positive ratio (1 minus specificity), and the vertical axis represents the true-positive ratio (sensitivity). The size of a plotting symbol is inversely proportional to the variance of an observation. Helical CT has a discriminatory power similar to that of V-P scanning at thresholds 1 and 3, while overall discriminatory power of helical CT is much better than that of V-P scanning at threshold 2.

 


View larger version (19K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2b. (a) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 1. (b) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 2. (c) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 3. (a-c) Horizontal axis represents the false-positive ratio (1 minus specificity), and the vertical axis represents the true-positive ratio (sensitivity). The size of a plotting symbol is inversely proportional to the variance of an observation. Helical CT has a discriminatory power similar to that of V-P scanning at thresholds 1 and 3, while overall discriminatory power of helical CT is much better than that of V-P scanning at threshold 2.

 


View larger version (17K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2c. (a) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 1. (b) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 2. (c) Summary ROC curves for helical CT constructed with a model used to compare V-P scans by using threshold 3. (a-c) Horizontal axis represents the false-positive ratio (1 minus specificity), and the vertical axis represents the true-positive ratio (sensitivity). The size of a plotting symbol is inversely proportional to the variance of an observation. Helical CT has a discriminatory power similar to that of V-P scanning at thresholds 1 and 3, while overall discriminatory power of helical CT is much better than that of V-P scanning at threshold 2.

 
Posttest Probability of PE
Posttest probabilities of PE according to positive or negative results of each test are summarized in Table 5. When pretest probabilities were moderate (.27) or high (.78), posttest probabilities of positive results of V-P scanning with threshold 1 and helical CT were .841 and .98, respectively. When pretest probability was low (.03), posttest probability of negative helical CT was .005, and that of negative V-P scanning with threshold 3 was .01. When pretest probabilities were moderate or high, posttest probabilities of negative helical CT and V-P scanning were not low enough to exclude a diagnosis of PE.


View this table:
[in this window]
[in a new window]

 
TABLE 5. Posttest Probabilities of PE with Different Pretest Probabilities and Test Results

 
Sensitivity Analysis
Results of sensitivity analysis are shown in Tables 3 and 4. By excluding articles published before 1995, pooled sensitivity (true-positive ratio) decreased from 86.0% to 84.6% (95% CI: 78.3%, 91.6%), while specificity did not change substantially (93.7%). Summary ROC analysis yielded a similar ß coefficient (0.588) and a slightly increased P value (.118).


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
In the present study, meta-analysis of the contemporary helical CT and V-P scanning literature was performed. On the basis of summary ROC analysis, helical CT and V-P scanning had similar discriminatory power when the high-probability threshold was used. The data also suggest that helical CT has greater discriminatory power than V-P scanning when the intermediate-probability threshold was negative. The results of the model comparing helical CT and V-P scanning with normal and/or near normal threshold is confusing because the ß coefficient is large, but it borders on being statistically significant (P = .05). As will be discussed later, few articles were included in this meta-analysis because of stringent inclusion criteria, which might have decreased our power to detect real difference of two diagnostic tests. To evaluate these issues, it is crucial to take a quantitative view of the data and its interpretation—that is, it is important to interpret the result in terms of effect size rather than in terms of testing (P value) (31). Thus, we would rather state that ß coefficient of 4.14 is high enough to suggest that helical CT has higher discriminatory power than V-P scanning with use of the normal and/or near-normal threshold.

In two studies (10,14), the diagnostic test performance of both helical CT and V-P scanning were reported in one article, but it was not plausible to directly compare these two modalities for several reasons. In one of these studies, not all patients underwent both tests, and data were insufficient for direct comparison (14). The only study we could use for direct comparison was one in which all patients underwent both helical CT and V-P scanning (10). It might be interesting to directly compare the performance of helical CT and V-P scanning; however, the main aim of the second study was not to compare the performance of the two tests, and there exists a verification bias (eg, patients underwent helical CT on the basis of results of V-P scanning). The direct comparison influenced by verification bias might have caused deviation in the performance of the latter test; thus, it was rather misleading to compare the two tests in this study (23).

Statistical Methods and Results
The summary ROC analysis allows us to compare results of different tests by summarizing sensitivity and specificity results from several studies into a single ROC curve (21). In the past decade, many articles have suggested that helical CT might be more effective than V-P scanning in the diagnosis of PE (32,33), but formal comparison of the two diagnostic tests has yet to be performed. Although previous reports have described helical CT as superior to V-P screening because of its higher sensitivity and specificity, one cannot state that helical CT is more useful for this reason alone. In general, a negative result essentially rules out PE when a test with very high sensitivity is used, and a positive result effectively confirms diagnosis of PE when a test with very high specificity is used (30). In the clinical setting, V-P scanning with a high-probability threshold and high specificity has been used to confirm a diagnosis of PE, and V-P scanning with a normal and/or near normal threshold with high sensitivity has been used to exclude a diagnosis of PE. It is therefore necessary to compare the sensitivity of helical CT with that of V-P scanning by using the normal and/or near-normal threshold to exclude the possibility of PE; it is also necessary to compare specificity to diagnose PE with summary ROC analysis.

There has been criticism of the sensitivities and specificities claimed in earlier helical CT reports (34,35). New diagnostic tests are often described in glowing terms when they are first introduced; however, these tests are often found to be wanting when more experience has been gained (36). This frequently results from limitations in the methods used to evaluate test characteristics. An example of this phenomenon is use of carcinoembryonic antigen (30). Carcinoembryonic antigen was originally considered a very promising tool in the diagnosis of colon cancer; however, carcinoembryonic antigen level was subsequently found to be increased in a wide variety of instances, including in smokers without cancer. The same seems true for helical CT used in the diagnosis of PE. When compared with earlier studies that showed high sensitivity (90%–100%) and specificity (96%–100%), the test performance was lower (sensitivity, 70%; specificity, 91%) in a study performed in 2001 (35). On the other hand, rapid advances in the CT method used in the diagnosis of PE have been made in the past decade. Thus, we performed sensitivity analysis, and the results of ROC analysis did not change when articles published before 1995 were excluded. Moreover, univariate analysis reveals that publication year is not statistically significant.

Another criticism of high accuracy rates, as pointed out in recently published review articles, is that methodologic problems are common in studies used to evaluate helical CT in the diagnosis of PE (eg, several reports were missing key data regarding the methods used to select patients). It is unclear, however, how these methodologic problems have influenced our results (26,37). The second PIOPED study, which is being funded by the U.S. National Institutes of Health, involves evaluation of the accuracy of helical CT in the diagnosis of PE in more than 1000 patients and should bring us closer to the solution of these methodologic problems (38).

Clinical Implications and Results
Our work shows that when the high probability threshold is used, helical CT and V-P scanning have comparable discriminatory power; however, when the normal and/or near-normal threshold is used, helical CT had greater discriminatory power than V-P scanning. It is now more important to consider other aspects of the tests. First, the main problem with V-P scanning is that definitive diagnosis can be obtained in less than 30% of patients tested, and the remaining patients need to undergo further testing (5). The use of helical CT would therefore reduce the number of patients subjected to further diagnostic tests (33). In contrast, one should consider the presence of contraindications before performing helical CT. In one report, about 24% of patients suspected of having PE did not undergo helical CT because of contraindications, such as impaired renal function or allergy to contrast agent (35). This is a substantial proportion of patients and is similar to the proportion of patients with inconclusive results of V-P scanning.

Second, as for V-P scanning, large differences (25%–30%) of interpretation among expert readers have been reported, especially in the classification of low- or intermediate-probability scans. In contrast, helical CT has better inter-observer agreement than does V-P scanning ({kappa} value of 0.85 and 0.61, respectively) (12). Third, there are inconsistent results concerning relative cost-effectiveness, with the controversy continuing (32,34). The advantages or disadvantages of either test are crucial in application to the patients when PE is suspected. It is important to judge the advantages and disadvantages of each test and select the most appropriate procedure for a favorable outcome. On the basis of the results of our analyses, we recommend the following strategies: Confirm PE with moderate to high pretest probability, and use either helical CT or V-P scanning, according to high-probability threshold. Exclude PE with low pretest probability, as helical CT is a better test than V-P scanning. If helical CT is not available, V-P scanning with normal and/or near-normal threshold could be an alternative technique. To exclude PE with moderate or high pretest probability or to confirm PE with low pretest probability, avoid V-P scanning by using low-probability threshold.

Limitations
Our review has several limitations. First, because of the nature of meta-analysis, the result is subject to publication bias. Only published reports were examined, and studies with poor results are less likely to be written, submitted, and accepted. Our results may therefore be biased toward the favorable direction. This tendency should affect helical CT and V-P scanning equally and should not alter our qualitative conclusion. Second, as in all meta-analyses of diagnostic testing, verification bias could be present, since about half of the studies included did not control or mention verification bias. Verification bias occurs when the result of the test influences the decision as to which patients receive the verification test. This can have dramatic results on the sensitivity and specificity of a test (39). We were unable to correct for this bias because the original studies did not provide the necessary information on the entire population tested; in our study, however, covariate analysis did not show a significant difference between studies that did and those that did not control for verification bias. Third, the large degree of variation between observers in reading V-P scans could limit the interpretation of our results when combining studies; this possibility was rejected with a test for homogeneity.

Numerous studies were excluded from analysis, and a smaller number of studies was finally included in meta-analysis compared with previous meta-analysis of helical CT (19). One reason is that some studies used combination reference standards to compare helical CT and V-P scanning (eg, normal results at V-P lung scanning were accepted as an alternative reference standard for the absence of PE) (12,33). The small number of studies included in the current analysis might have decreased our power to detect the true difference. This could not have been avoided, however, because it is usually assumed that the test is being compared with a sole reference standard when meta-analysis of diagnostic tests is conducted. We therefore chose the optimal strategy that pulmonary angiography should be the sole reference standard. For the same reason as mentioned previously, only one multi–detector row helical CT report was included in our analysis. We might have underestimated the test performance of helical CT because the recent introduction of multi–detector row helical CT is expected to offer a further increase in performance, particularly in the ability of physicians to scan larger anatomic volumes with high spatial resolution (40). The stringent inclusion criteria used in the current study, however, should have ensured the quality of our results.

Independently pooled estimates of sensitivity and specificity could be calculated easily; however, these frequently used methods have come under strong criticism because they do not take into account the fact that different studies may have used test thresholds (41). In spite of this, the reason why we pooled estimates of sensitivity and specificity separately is that the results of ß coefficients are not always easy for readers to intuitively understand. In a real-world setting, reports of summary ROC analysis present the results of pooled sensitivities; specificities are presented in reports of summary ROC analysis (42,43). Another reason we pooled estimates of sensitivity and specificity separately is that authorities in the field of decision sciences recommend to pool sensitivities and specificities of diagnostic tests and use these data for cost-effectiveness analysis (44). These pooled estimates, however, should carefully be interpreted when used to compare these two diagnostic tests directly.

In conclusion, helical CT has greater discriminatory power than V-P scanning with the normal and/or near-normal threshold in the exclusion of PE, while helical CT and V-P scanning with high-probability threshold had similar discriminatory power in the diagnosis of PE.


    FOOTNOTES
 
Abbreviations: CI = confidence interval, MAA = macroaggregated albumin, PE = pulmonary embolism, PIOPED = Prospective Investigation of Pulmonary Embolism Diagnosis, PYP = pyrophosphate, ROC = receiving operator characteristic, V-P = ventilation-perfusion

Authors stated no financial relationship to disclose.

Author contributions: Guarantor of integrity of entire study, Y.H.; study concepts and design, all authors; literature research, Y.H., M.G.; data acquisition, Y.H., G.M.; data analysis/interpretation, all authors; statistical analysis, Y.H., Y.N.; manuscript preparation and definition of intellectual content, all authors; manuscript editing and revision/review, Y.H., Y.N., T.F.; manuscript final version approval, all authors


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Wells PS, Ginsberg JS, Anderson DR, et al. Use of a clinical model for safe management of patients with suspected pulmonary embolism. Ann Intern Med 1998; 129:997-1005.[Abstract/Free Full Text]
  2. Hyers TM. Venous thromboembolism. Am J Respir Crit Care Med 1999; 159:1-14.[Free Full Text]
  3. Mills SR, Jackson DC, Older RA, Heaston DK, Moore AV. The incidence, etiologies, and avoidance of complications of pulmonary angiography in a large series. Radiology 1980; 136:295-299.[Abstract/Free Full Text]
  4. Stein PD, Athanasoulis C, Alavi A, et al. Complications and validity of pulmonary angiography in acute pulmonary embolism. Circulation 1992; 85:462-468.[Abstract/Free Full Text]
  5. The PIOPED Investigators. Value of the ventilation/perfusion scan in acute pulmonary embolism: results of the prospective investigation of pulmonary embolism diagnosis (PIOPED). JAMA 1990; 263:2753-2759.[Abstract/Free Full Text]
  6. Stein PD, Hull RD, Raskob GE. Withholding treatment in patients with acute pulmonary embolism who have a high risk of bleeding and negative serial noninvasive leg tests. Am J Med 2000; 109:301-306.[CrossRef][Medline]
  7. Righini M, Goehring C, Bounameaux H, Perrier A. Effects of age on the performance of common diagnostic tests for pulmonary embolism. Am J Med 2000; 109:357-361.[CrossRef][Medline]
  8. Remy-Jardin M, Remy J, Wattinne L, Giraud F. Central pulmonary thromboembolism: diagnosis with spiral volumetric CT with the single-breath-hold technique—comparison with pulmonary angiography. Radiology 1992; 185:381-387.[Abstract/Free Full Text]
  9. Blum AG, Delfau F, Grignon B, et al. Spiral-computed tomography versus pulmonary angiography in the diagnosis of acute massive pulmonary embolism. Am J Cardiol 1994; 74:96-98.[CrossRef][Medline]
  10. Goodman LR, Curtin JJ, Mewissen MW, et al. Detection of pulmonary embolism in patients with unresolved clinical and scintigraphic diagnosis: helical CT versus angiography. AJR Am J Roentgenol 1995; 164:1369-1374.[Abstract/Free Full Text]
  11. Remy-Jardin M, Remy J, Deschildre F, et al. Diagnosis of pulmonary embolism with spiral CT: comparison with pulmonary angiography and scintigraphy. Radiology 1996; 200:699-706.[Abstract/Free Full Text]
  12. Mayo JR, Remy-Jardin M, Muller NL, et al. Pulmonary embolism: prospective comparison of spiral CT with ventilation-perfusion scintigraphy. Radiology 1997; 205:447-452.[Abstract/Free Full Text]
  13. Drucker EA, Rivitz SM, Shepard JA, et al. Acute pulmonary embolism: assessment of helical CT for diagnosis. Radiology 1998; 209:235-241.[Abstract/Free Full Text]
  14. Garg K, Welsh CH, Feyerabend AJ, et al. Pulmonary embolism: diagnosis with spiral CT and ventilation-perfusion scanning—correlation with pulmonary angiographic results or clinical outcome. Radiology 1998; 208:201-208.[Abstract/Free Full Text]
  15. Qanadli SD, Hajjam ME, Mesurolle B, et al. Pulmonary embolism detection: prospective evaluation of dual-section helical CT versus selective pulmonary arteriography in 157 patients. Radiology 2000; 217:447-455.[Abstract/Free Full Text]
  16. Ruiz Y, Caballero P, Caniego JL, et al. Prospective comparison of helical CT with angiography in pulmonary embolism: global and selective vascular territory analysis—interobserver agreement. Eur Radiol 2003; 13:823-829.[Medline]
  17. Nilsson T, Soderberg M, Lundqvist G, et al. A comparison of spiral computed tomography and latex agglutination D-dimer assay in acute pulmonary embolism using pulmonary arteriography as gold standard. Scand Cardiovasc J 2002; 36:373-377.[CrossRef][Medline]
  18. van Beek EJ, Brouwers EM, Song B, Bongaerts AH, Oudkerk M. Lung scintigraphy and helical computed tomography for the diagnosis of pulmonary embolism: a meta-analysis. Clin Appl Thromb Hemost 2001; 7:87-92.
  19. Safriel Y, Zinn H. CT pulmonary angiography in the detection of pulmonary emboli: a meta-analysis of sensitivities and specificities. Clin Imaging 2002; 26:101-105.[CrossRef][Medline]
  20. Littenberg B, Moses LE. Estimating diagnostic accuracy from multiple conflicting reports: a new meta-analytic method. Med Decis Making 1993; 13:313-321.
  21. Moses LE, Shapiro D, Littenberg B. Combining independent studies of a diagnostic test into a summary ROC curve: data-analytic approaches and some additional considerations. Stat Med 1993; 12:1293-1316.[Medline]
  22. Irwig L, Tosteson AN, Gatsonis C, et al. Guidelines for meta-analyses evaluating diagnostic tests. Ann Intern Med 1994; 120:667-676.[Abstract/Free Full Text]
  23. Fleischmann KE, Hunink MG, Kuntz KM, Douglas PS. Exercise echocardiography or exercise SPECT imaging? a meta-analysis of diagnostic test performance. JAMA 1998; 280:913-920.[Abstract/Free Full Text]
  24. Owens DK, Holodniy M, McDonald TW, Scott J, Sonnad S. A meta-analytic evaluation of the polymerase chain reaction for the diagnosis of HIV infection in infants. JAMA 1996; 275:1342-1348.[Abstract/Free Full Text]
  25. Harvey RT, Gefter WB, Hrung JM, Langlotz CP. Accuracy of CT angiography versus pulmonary angiography in the diagnosis of acute pulmonary embolism: evaluation of the literature with summary ROC curve analysis. Acad Radiol 2000; 7:786-797.[CrossRef][Medline]
  26. Rathbun SW, Raskob GE, Whitsett TL. Sensitivity and specificity of helical computed tomography in the diagnosis of pulmonary embolism: a systematic review. Ann Intern Med 2000; 132:227-232.[Abstract/Free Full Text]
  27. Trujillo NP, Pratt JP, Talusani S, Quaife RA, Kumpe D, Lear JL. DTPA aerosol in ventilation/perfusion scintigraphy for diagnosing pulmonary embolism. J Nucl Med 1997; 38:1781-1783.[Abstract/Free Full Text]
  28. Midgette AS, Stukel TA, Littenberg B. A meta-analytic method for summarizing diagnostic test performances: receiver-operating-characteristic-summary point estimates. Med Decis Making 1993; 13:253-257.
  29. Woods ER, Iles S, Jackson S. Comparison of scintigraphic diagnostic criteria in suspected pulmonary embolism. Can Assoc Radiol J 1989; 40:194-197.[Medline]
  30. Fletcher RH, Fletcher SW, Wagner EH. Diagnosis. In: Satterfield TS, Taylor C, eds. Clinical epidemiology. 3rd ed. Philadelphia, Pa: Lippincott Williams & Wilkins, 1996; 43-66.
  31. Rothman KJ. Random error and role of statistics In: Epidemiology. An introduction. New York, NY: Oxford University Press, 2002; 113-129.
  32. van Erkel AR, van Rossum AB, Bloem JL, Kievit J, Pattynama PM. Spiral CT angiography for suspected pulmonary embolism: a cost-effectiveness analysis. Radiology 1996; 201:29-36.[Abstract/Free Full Text]
  33. van Rossum AB, Pattynama PM, Mallens WM, Hermans J, Heijerman HG. Can helical CT replace scintigraphy in the diagnostic process in suspected pulmonary embolism? a retrolective-prolective cohort study focusing on total diagnostic yield. Eur Radiol 1998; 8:90-96.[CrossRef][Medline]
  34. Paterson DI, Schwartzman K. Strategies incorporating spiral CT for the diagnosis of acute pulmonary embolism: a cost-effectiveness analysis. Chest 2001; 119:1791-1800.[Abstract/Free Full Text]
  35. Perrier A, Howarth N, Didier D, et al. Performance of helical computed tomography in unselected outpatients with suspected pulmonary embolism. Ann Intern Med 2001; 135:88-97.[Abstract/Free Full Text]
  36. Rozanski A, Diamond GA, Berman D, Forrester JS, Morris D, Swan HJ. The declining specificity of exercise radionuclide ventriculography. N Engl J Med 1983; 309:518-522.[Abstract]
  37. Mullins MD, Becker DM, Hagspiel KD, Philbrick JT. The role of spiral volumetric computed tomography in the diagnosis of pulmonary embolism. Arch Intern Med 2000; 160:293-298.[Abstract/Free Full Text]
  38. Kearon C. Excluding pulmonary embolism with helical (spiral) computed tomography: evidence is catching up with enthusiasm (commentary). CMAJ 2003; 168:1430-1431.[Free Full Text]
  39. Roger VL, Pellikka PA, Bell MR, Chow CW, Bailey KR, Seward JB. Sex and test verification bias: impact on the diagnostic value of exercise echocardiography. Circulation 1997; 95:405-410.[Abstract/Free Full Text]
  40. Ghaye B, Szapiro D, Mastora I, et al. Peripheral pulmonary arteries: how far in the lung does multi-detector row spiral CT allow analysis? Radiology 2001; 219:629-636.[Abstract/Free Full Text]
  41. Irwig L, Macaskill P, Glasziou P, Fahey M. Meta-analytic methods for diagnostic test accuracy. J Clin Epidemiol 1995; 48:119-130.[CrossRef][Medline]
  42. Oei EH, Nikken JJ, Verstijnen AC, Ginai AZ, Myriam Hunink MG. MR imaging of the menisci and cruciate ligaments: a systematic review. Radiology 2003; 226:837-848.[Abstract/Free Full Text]
  43. Visser K, Hunink MG. Peripheral arterial disease: gadolinium-enhanced MR angiography versus color-guided duplex US—a meta-analysis. Radiology 2000; 216:67-77.[Abstract/Free Full Text]
  44. Hunink M, Glasziou P, Siegel J, et al. Finding and summarizing the evidence: decision making in health and medicine—integrating evidence and values Cambridge, England: Cambridge University Press, 2001; 214-244.



This article has been cited by other articles:


Home page
RadiologyHome page
M.-P. Revel, R. Triki, G. Chatellier, S. Couchon, N. Haddad, A. Hernigou, C. Danel, and G. Frija
Is It Possible to Recognize Pulmonary Infarction on Multisection CT Images?
Radiology, September 1, 2007; 244(3): 875 - 882.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
A. Scarsbrook, K. Bradley, F. Gleeson, A. M. Groves, S. J. Yates, T. Win, I. Kayani, J. Bomanji, and P. J. Ell
Perfusion Scintigraphy Still has Important Role in Evaluation of Majority of Pregnant Patients with Suspicion of Pulmonary Embolism
Radiology, August 1, 2007; 244(2): 623 - 625.
[Full Text] [PDF]


Home page
RadioGraphicsHome page
C. H. McCollough, B. A. Schueler, T. D. Atwell, N. N. Braun, D. M. Regner, D. L. Brown, and A. J. LeRoy
Radiation Exposure and Pregnancy: When Should We Be Concerned?
RadioGraphics, July 1, 2007; 27(4): 909 - 917.
[Abstract] [Full Text] [PDF]


Home page
JNMHome page
N. Tunariu, S. J.R. Gibbs, Z. Win, W. Gin-Sing, A. Graham, P. Gishen, and A. AL-Nahhas
Ventilation-Perfusion Scintigraphy Is More Sensitive than Multidetector CTPA in Detecting Chronic Thromboembolic Pulmonary Disease as a Treatable Cause of Pulmonary Hypertension
J. Nucl. Med., May 1, 2007; 48(5): 680 - 684.
[Abstract] [Full Text] [PDF]


Home page
Ann Fam MedHome page
J. B. Segal, J. Eng, L. J. Tamariz, and E. B. Bass
Review of the Evidence on Diagnosis of Deep Venous Thrombosis and Pulmonary Embolism
Ann. Fam. Med, January 1, 2007; 5(1): 63 - 73.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
U. R. O. Nyman, B. Jacobsson, M. D. Cham, D. F. Yankelevitz, and C. I. Henschke
Routine CT Venography after CT for Pulmonary Embolism: Evidence-based Radiology or Hemorrhage from Anticoagulation of False-Positive Deep Venous Thrombosis?
Radiology, December 1, 2006; 241(3): 945 - 947.
[Full Text] [PDF]


Home page
RadiologyHome page
A. M. Groves, S. J. Yates, T. Win, I. Kayani, F. A. Gallagher, R. Syed, J. Bomanji, and P. J. Ell
CT Pulmonary Angiography versus Ventilation-Perfusion Scintigraphy in Pregnancy: Implications from a UK Survey of Doctors' Knowledge of Radiation Exposure
Radiology, September 1, 2006; 240(3): 765 - 770.
[Abstract] [Full Text] [PDF]


Home page
Br. J. Radiol.Home page
S Matthews
Imaging pulmonary embolism in pregnancy: what is the most appropriate imaging protocol?
Br. J. Radiol., May 1, 2006; 79(941): 441 - 444.
[Abstract] [Full Text] [PDF]


Home page
Evid. Based Med.Home page
ADDITIONAL ARTICLES ABSTRACTED IN ACP JOURNAL CLUB
Evid. Based Med., October 1, 2005; 10(5): 156 - 156.
[Full Text] [PDF]


Home page
ChestHome page
E. S. Darze, A. L. Latado, A. G. Guimaraes, R. A. V. Guedes, A. B. Santos, S. S. de Moura, and L. C. S. Passos
Incidence and Clinical Predictors of Pulmonary Embolism in Severe Heart Failure Patients Admitted to a Coronary Care Unit
Chest, October 1, 2005; 128(4): 2576 - 2580.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hayashino, Y.
Right arrow Articles by Fukui, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hayashino, Y.
Right arrow Articles by Fukui, T.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE