|
|
||||||||
Evidence-based Practice |
1 From the Departments of Surgery (M.W., J.J.B.v.L.), Clinical Epidemiology and Biostatistics (J.B.R.), Radiology (J.S.), Gastroenterology (P.F.), and Nuclear Medicine (B.L.F.V.E., G.W.S.), Academic Medical Center, University of Amsterdam, Suite G4-130, Meibergdreef 9, 1105 AZ Amsterdam, the Netherlands; Department of Surgery (H.L.v.W., J.T.M.P.) and Nuclear Medicine/PET Center (P.L.J.), University Hospital Groningen, Groningen, the Netherlands; and Department of Nuclear Medicine and PET Research, VU Medical Center, Amsterdam, the Netherlands (O.S.H.). Received June 11, 2004; revision requested August 24; revision received September 28; accepted October 22. Address correspondence to M.W. (e-mail: m.westerterp{at}AMC.UVA.NL).
| ABSTRACT |
|---|
|
|
|---|
MATERIALS AND METHODS: MEDLINE and EMBASE databases and Cochrane Database of Systematic Reviews were searched for relevant studies. Two reviewers independently assessed the methodological quality of each study. Summary receiver operating characteristic (ROC) analysis was used to summarize and compare the diagnostic accuracy of the three modalities.
RESULTS: Four studies with CT, 13 with endoscopic US, and seven with FDG PET met inclusion criteria. Percentages of the maximum score in regard to methodological quality ranged from 15% to 100%. Summary ROC analysis could be performed for three studies with CT, four with endoscopic US, and four with FDG PET. The maximum joint values for sensitivity and specificity were 54% for CT, 86% for endoscopic US, and 85% for FDG PET. Accuracy of CT was significantly lower than that of FDG PET (P < .006) and of endoscopic US (P < .003). Accuracy of FDG PET and that of endoscopic US were similar (P = .839). In all patients, CT was always feasible, whereas endoscopic US was not feasible in 6% of the patients, and FDG PET was not feasible in less than 1%.
CONCLUSION: CT has poor accuracy for assessment of response to neoadjuvant therapy in patients with esophageal cancer. Endoscopic US and FDG PET have equivalent good accuracy, but endoscopic US is not always feasible after chemotherapy and radiation therapy. FDG PET seems to be a promising noninvasive tool for assessment of neoadjuvant therapy in patients with esophageal cancer.
© RSNA, 2005
| INTRODUCTION |
|---|
|
|
|---|
In a large proportion of patients, however, an insufficient objective response is achieved. These patients do not benefit from neoadjuvant therapy but suffer from toxic side effects while appropriate surgical therapy is delayed. For this reason, a diagnostic test that can be used to accurately predict tumor response early in the course of neoadjuvant therapy is of crucial importance. Currently, there is no universally accepted, reproducible, and reliable means of monitoring the response of esophageal cancer to chemotherapy.
Response to therapy currently is evaluated by using morphologic imaging, such as computed tomography (CT) and endoscopic ultrasonography (US) (810). General restrictions of these methods are the difficulty in distinguishing viable tumor from necrotic or fibrotic tissue and the delay between cell kill and tumor shrinkage (5,11,12).
With fluorine 18 fluorodeoxyglucose (FDG) positron emission tomography (PET), alterations in tissue metabolism that generally precede anatomic change are reflected. FDG preferentially accumulates in cells with high rates of glucose utilization (eg, the brain, the myocardium, and most solid malignancies). The accumulation of FDG in tumor cells can be measured noninvasively by using PET. There are various approaches for analytic methods that range from visual assessment (qualitative) to semiquantitative indices (eg, standardized uptake value [SUV]). FDG PET has been shown to be sensitive in the imaging of several malignancies (eg, lymphoma, lung cancer, colorectal cancer, and head and neck cancer) and has been shown to be promising for detection of the response to nonsurgical therapy in breast cancer (13). Thus, the purpose of our study was to compare the diagnostic accuracy of CT, endoscopic US, and FDG PET for the assessment of the response to neoadjuvant therapy in patients with esophageal cancer by using a systematic review of the literature.
| MATERIALS AND METHODS |
|---|
|
|
|---|
A manual search with cross-references of the eligible articles was performed to identify additional relevant articles. We did not include conference abstracts because of the limited data presented in them.
The same two authors independently assessed articles for possible inclusion in the review by checking titles and abstracts. Included were clinical studies that fulfilled all the following inclusion criteria: evaluation of CT, endoscopic US, or FDG PET in the assessment of the response to neoadjuvant therapy (before and after therapy); histologic proof of cancer of the esophagus; presence of gastroesophageal junction or gastric cardia; use of a valid reference standard (ie, pathologic findings); and a number of patients of 10 or greater. Duplicate studies involving the same patients were excluded. The final decision about inclusion was based on the full article. Disagreement was resolved in a consensus meeting.
Quality Assessment
The same two reviewers who performed the search and assessed publications for inclusion independently assessed the methodological quality of the included studies. They used the list of criteria recommended by the Cochrane Methods Working Group on Systematic Review of Screening and Diagnostic Tests, with some modifications (14). The complete list of criteria is shown in Table 1.
|
Data and Statistical Analysis
For all included studies, an attempt was made to extract the 2 x 2 table that was created with cross-classification of the patients on the basis of the results of CT, endoscopic US, and FDG PET and the final outcome that was confirmed with the reference standard (histologic findings). From these tables, we calculated sensitivity and specificity, together with exact 95% confidence intervals, on the basis of the binomial distribution for each study. Forrest plots were used to present data. For each study, we plotted the pairs of sensitivity and specificity in receiver operating characteristic (ROC) space and used the approach of Moses et al (15), Littenberg and Moses (16), and Irwig et al (17) to summarize the data by fitting the summary ROC curve. This method models test accuracy, defined by the log diagnostic odds ratio (D), as a linear function of the test threshold (S). S represents the (implicit) threshold for a positive test result for each study. The model D = a + bS was used to capture the variation in the diagnostic odds ratio caused by differences in test threshold between studies. This model was fitted by using equally weighted least squares regression because this model produces results that are more consistent with a random-effects approach and allows additional variance in accuracy beyond the test threshold and sampling error (18). We transformed the regression line back to the original ROC space to obtain the summary ROC curve. Because the log odds ratio is difficult to interpret clinically, we expressed the results in terms of the maximum joint sensitivity and specificity, also known as the Q-point. This is the point on the summary ROC curve closest to the optimal upper left corner of the ROC plot. It is also the point where sensitivity equals specificity.
We used the Q-point to test whether there was a difference in accuracy between techniques. This test was directly derived from the model by comparing the diagnostic odds ratio at S = 0 among the three modalities. For clinical purposes, however, a high negative predictive value of a modality is necessary to avoid erroneous discontinuation of neoadjuvant therapy. Therefore, the predicted value of specificity at a high value of sensitivity of 90% was calculated from the summary ROC regression line. A difference with a two-sided P value less than .05 was considered significant. To avoid division by zero in the calculation of the diagnostic odds ratio, the standard correction of adding 0.5 to all four cells of the 2 x 2 table was applied when one of the four cells contained a zero. Statistical analyses were performed by using a statistical software system (SAS, version 8.02; SAS, Cary, NC).
| RESULTS |
|---|
|
|
|---|
Five studies were potentially relevant to the value of CT in the monitoring of a response. Of these remaining five studies, the study by Helmberger et al (19) was excluded because of the absence of a valid reference standard. In that study, CT during neoadjuvant therapy was compared with endoscopy, although the value of endoscopy in the monitoring of the response to therapy is still debatable. Thus, only four studies with CT met the inclusion criteria (2023).
In the case of endoscopic US, 14 studies were potentially relevant to the value of a therapeutic response assessment. The study by Nousbaum et al (24) was excluded because of the absence of a valid reference standard; the authors used recurrence instead of histologic findings as the reference standard. Thus, 13 studies with endoscopic US met the inclusion criteria (23,2536).
For studies with FDG PET, 11 were potentially relevant to the value of therapeutic response assessment. Of these 11 studies, four were excluded after a review of the full article. One study was excluded because of the absence of a baseline FDG PET image (37). Couper et al (38) compared FDG PET during neoadjuvant therapy with CT and changes in weight and dysphagia scores. Since the value of CT in monitoring a response is still under discussion, it is not suitable to be used as a reference standard. Another two studies were excluded because one comprised carcinomas other than esophageal cancer and one included fewer than 10 patients with esophageal cancer (39,40). Thus, seven studies met the inclusion criteria (23,4146). The characteristics of the included studies are shown in Table 2.
|
|
Quality of studies with FDG PET was high for definition of the reference test (internal validity criterion 1: six of seven = 86% vs three of four = 75% for studies with CT and only four of 13 = 31% for studies with endoscopic US) and definition of the index test (internal validity criterion 2: seven of seven = 100% vs only six of 13 = 46% for studies with endoscopic US). The definition of the index test was also present in all studies with CT.
Studies with CT
Walker et al (20) evaluated 38 patients with esophageal cancer by using CT and a barium swallow study before and after completion of preoperative chemotherapy. Chemotherapy consisted of a combination of fluorouracil, doxorubicin hydrochloride (Adriamycin), and mitomycin (n = 15 patients), also known as FAM; a combination of mitomycin, ifosfamide, and cisplatin (n = 19 patients), also known as MIC; or a combination of cyclophosphamide, doxorubicin hydrochloride, vincristine sulfate, methotrexate, and etoposide (n = 4), also known as CAPOMET. Radiographic response was assessed according to the criteria of Miller et al (47). Pathologic response was defined as follows: complete microscopic response (no microscopic evidence of residual tumor); complete macroscopic response (no visible evidence of residual tumor); partial response (unequivocal signs of healing); no response (no substantial signs of healing). A response to therapy, either partial or complete, was found in 95% (n = 36) of the patients, and 47% (n = 18) of all patients showed a response at CT.
Griffith et al (21) assessed the therapeutic response by using spiral CT after completion of two courses of fluorouracil and cisplatin in 45 patients with squamous cell carcinoma. Response to therapy was defined as a tumor volume reduction of more than 50%. The pathologic response was assessed according to the criteria of Mandard et al (48). According to these criteria, 24% (n = 11) of patients were responders. Griffith et al did not find a correlation between tumor volume reduction at serial CT and quantitative pathologic tumor assessment, nor did they find a correlation between tumor volume reduction and survival.
Jones et al (22) assessed the value of CT for tumor response assessment in 50 patients with esophageal cancer who were treated with fluorouracil and cisplatin, whereas in 12 patients, paclitaxel was added to the regimen. Radiographic response was determined by using the response criteria for solid tumors of the Eastern Cooperative Oncology Group. Pathologic response was defined as follows: no tumor (ie, responders, 42% [n = 21]) and tumor present (ie, nonresponders, 58% [n = 29]). CT had a sensitivity of 33% and a specificity of 66% in accurate evaluation of the pathologic response of the tumor to chemoradiation therapy.
Kroep et al (23) evaluated 13 patients with esophageal cancer who were treated with neoadjuvant cisplatin and gemcitabine plus granulocyte macrophage colony-stimulating growth factor. CT assessment of the therapeutic response was determined, according to World Health Organization criteria (47). A pathologic response was observed in 36% of the patients, as assessed according to criteria of Mandard et al (48). Both early (after two cycles of neoadjuvant therapy) and late (after completion of neoadjuvant therapy) response evaluation showed a specificity of 50% and a sensitivity of 71%.
Studies with Endoscopic US
In nine studies with endoscopic US that were included, restaging, which was indicated by a change in tumor stage, was used as a parameter for assessment of neoadjuvant therapeutic response (23,2532). In three endoscopic US studies, change in volume measurements of the maximum tumor cross-sectional dimensions was used as a parameter for assessment (3336). Researchers in one study evaluated both restaging and volume measurements (36). These parameters are different from those we used, and, thus, evaluation of the results with them was interpreted separately.
Restaging.In seven of nine studies, a test definition was not described in regard to restaging with endoscopic US after neoadjuvant therapy (2527,2932,36). Before and after therapy, staging with the TNM classification was described properly, but the definition of responders or nonresponders was not described. Therefore, the value of endoscopic US for the monitoring of a response in these studies could not be determined. Accuracy in regard to tumor staging after completion of therapy, however, could be determined and ranged from 27%82%, with a median of only 48%.
Giovannini et al (28) described a test definition. Thirty-two patients were included in that study. Complete endoscopic US was not performed in six of the 32 patients before therapy because of esophageal stenosis. After therapy, complete endoscopic US was performed in all patients. Thus, findings at endoscopic US in 26 patients with T3 and T4 esophageal carcinomas who were treated with chemoradiotherapy were correlated to the histologic findings in the resected specimens. Chemoradiotherapy consisted of two courses of fluorouracil and cisplatin and a total of 30 Gy of radiation therapy. Endoscopic US criteria for prediction of a response were as follows: T0, complete response; Tw, microscopic evidence of tumor residue (major response); and T2T4, no response, or minor response (depending on the primary stage prior to chemoradiotherapy). If the histologic findings in the resected specimen indicated no tumor (complete responder) or pT1 tumor (partial responder), patients were defined as responders.
Kroep et al (23) evaluated 13 patients with esophageal cancer who were treated with neoadjuvant cisplatin and gemcitabine plus granulocyte macrophage colony-stimulating growth factor. At endoscopic US, the therapeutic response assessment was defined as complete if there was no visible tumor left, as partial after downstaging with the TNM criteria, as progressive in case of an increase in the TNM stage, or as stable when no change in TNM stage was determined. Pathologic response was assessed according to criteria of Mandard et al (48). The early (after two cycles) and late (after completed neoadjuvant therapy) responses showed a specificity of 100% and a sensitivity of 100% in 12 patients who could be evaluated. Evaluation was not feasible in one patient because of the development of liver metastases, which precluded surgical resection. In one patient, endoscopic US after treatment was not feasible because of the impossibility of esophageal passage caused by the tumor. This patient, however, was classified as a nonresponder and was not excluded from the analysis.
Volume measurement.Isenberg et al (36) conducted a study in 31 patients to determine whether the estimation of tumor size at endoscopic US could be used to assess the response to preoperative chemotherapy and radiation therapy (cisplatin and/or carboplatin with fluorouracil and concurrent mediastinal radiation of at least 30 Gy). An endoscopic US response was defined as a 50% reduction in maximal cross-sectional area of the tumor. A shortcoming of this study is the absence of a definition of pathologic responders and nonresponders, the lack of which makes it impossible to reproduce the data.
Chak et al (34) measured maximal cross-sectional area in 50 patients before and after neoadjuvant therapy (cisplatin or carboplatin and fluorouracil concurrently administered with a total of 50 Gy radiation therapy). Patients with more than a 50% reduction were classified as endoscopic US responders. The authors compared survival and not pathologic findings, with patients classified as responders and the patients classified as nonresponders with endoscopic US. Survival time in the group of responders was significantly longer than it was in the group of nonresponders (median survival of 17.6 months vs 14.5 months, P < .05).
Hirata et al (33) evaluated pre- and posttherapeutic endoscopic US in 34 patients who underwent various preoperative treatments consisting of radiation therapy (to a total of 30 Gy), chemotherapy (bleomycin or cisplatin), and hyperthermia. In 17 patients, the tumor could not be passed before treatment; in these patients, the maximum cross- sectional area was measured only at the level of the most cranial portion of the tumor. Response at endoscopic US was defined as a 15%30% reduction and more than a 30% reduction in maximal cross-sectional area. Pathologic data were available for 27 patients who underwent surgery. Histopathologic response was graded according to the number of viable cells in the entire lesion. Therapy was considered ineffective when viable cells occupied more than two-thirds of the entire tumor. Hirata et al found a correlation between percentage of reduction in maximal cross-sectional area and histologic evidence of effectiveness of neoadjuvant therapy. There was a significant difference (P = .05) in overall survival rates among the three groups, according to a reduction in the area of the tumor: less than 15%, 15%30%, and more than 30%.
Willis et al (35) correlated responses as measured with endoscopic US in 41 patients with a defined pathologic tumor regression grade as proposed by Mandard et al (48). Patients received cisplatin or carboplatin and fluorouracil concurrently with a total of 50 Gy of radiation therapy. A positive response at endoscopic US was defined as more than a 50% reduction in maximal cross-sectional area. Willis et al found a strong correlation between the endoscopic US therapeutic response and pathologic tumor regression induced with chemotherapy and radiation therapy.
Studies with FDG PET
Arslan et al (41) evaluated the value of the use of FDG PET in 24 patients who underwent chemoradiation therapy consisting of administration of cisplatin and fluorouracil, cisplatin and taxotore, or carboplatin and fluorouracil and concurrent radiation therapy with 50.4 Gy. The response of the primary tumor was visually assessed and analyzed semiquantitatively with the SUV before and after treatment. Patients were classified into two groups according to histopathologic findings at surgery: responding patients without gross disease and nonresponding patients with gross disease. Differences in quantitative FDG PET data between responders and nonresponders were calculated as a mean value for both groups and not on a per-patient basis. Therefore, accuracy could not be assessed.
Kato et al (42) described 10 patients who received nedaplatin and fluorouracil in combination with radiation therapy of 40 Gy. Histologic response, according to the criteria of the Japanese Society for Esophageal Diseases (49), was correlated with the SUV after treatment and the FDG uptake length decrease; however, it was not correlated with the rate of SUV decrease.
Downey et al (43) assessed the prognostic value of FDG PET in 17 patients with esophageal carcinoma who were treated with chemotherapy (paclitaxel and cisplatin), radiation therapy (50.4 Gy), and surgery. At a threshold of a 60% decrease in FDG uptake, they found a significant difference (P = .055) in 2-year disease-free survival between patients who had a smaller decrease in SUV (38% survival) and patients who had a greater decrease in SUV (67% survival).
In a study of Brucher et al (44), 27 patients were included who underwent external-beam radiation therapy and treatment with fluorouracil as a continuous infusion. Pathologic response was assessed according to the criteria of Mandard et al (48). At a threshold level of a 52% decrease in FDG uptake compared with the baseline value, sensitivity for prediction of a response was 100%, with a corresponding specificity of 55%. The positive and negative predictive values were 72% and 100%, respectively. Median survival was 22.5 months ± 2.4 (standard deviation) in responders evaluated with FDG PET versus only 6.7 months ± 5 in nonresponders evaluated with FDG PET (P < .001).
Weber et al (45) reported a series of 40 patients who received chemotherapy of 72 days duration. After 14 days of therapy, mean reduction of tumor FDG uptake, according to criteria of Mandard et al (48), was significantly different between responders (54% ± 17) and nonresponders (15% ± 21). An optimal differentiation was achieved by using a cutoff value of 35%. The 2-year survival rate was 60% in responders evaluated with FDG PET versus 37% in nonresponders evaluated with FDG PET (P < .04).
Flamen et al (46) included 36 patients in their study with T4 esophageal cancer, who received concomitant external-beam radiation therapy plus cisplatin and fluorouracil. Patients were classified as major responders evaluated with FDG PET when a strong reduction in FDG uptake was demonstrated at posttherapy PET. A patient was classified as a major responder to therapy when the histologic findings of the primary tumor in the resection specimen indicated a classification of pT0pT2 or pT3 (if the residual tumor consisted of small foci of viable tumor on a background of an extensive histopathologic response) and there was no sign of any tumoral viability beyond the primary tumor site. After patients completed chemoradiotherapy, the sensitivity of serial FDG PET for a major response was 71% (10 of 14 patients), and specificity was 82% (18 of 22 patients). Major responders evaluated with FDG PET had a median survival of 16.3 months; and nonmajor responders, 6.4 months (P = .005)
Kroep et al (23) evaluated 13 patients with esophageal cancer who were treated with neoadjuvant cisplatin and gemcitabine plus granulocyte macrophage colony-stimulating growth factor. The pathologic response was assessed according to criteria of Mandard et al (48). Evaluation of early (after two cycles) and late (after completed induction therapy) responses yielded a specificity of 86% and 100%, respectively, and a sensitivity of 100% at both intervals, with cutoff values of 40% for early responses and 60% for late responses.
Comparison of Diagnostic Accuracy
Three studies with CT (2123), four with endoscopic US (23,28,33,35), and four with FDG PET (23,4446) provided enough data to permit calculation of sensitivity and specificity (Fig 1). The sensitivity of CT, endoscopic US, and FDG PET ranged from 33% to 55%, 50% to 100%, and 71% to 100%, respectively. The specificity ranged from 50% to 71%, 36% to 100%, and 55% to 100%, respectively. Summary ROC curves for the three modalities are given in Figure 2. The maximum joint sensitivity and specificity (Q-point) values for CT, endoscopic US, and FDG PET were 54% (95% confidence interval: 31%, 77%), 86% (95% confidence interval: 80%, 93%), and 85% (95% confidence interval: 77%, 93%), respectively. Overall accuracy of CT was significantly lower than was the accuracy of endoscopic US (P < .003) and of FDG PET (P < .006). Overall accuracy of endoscopic US was similar to that of FDG PET (P = .839). The estimated specificity values of CT, endoscopic US, and FDG PET (values that corresponded to a desired level of sensitivity of 90%) were 13%, 78%, and 78%, respectively. Evaluation of neoadjuvant therapy in the analyzed studies was not feasible by using FDG PET in one (<1%) of 116 patients, and it was not feasible by using endoscopic US in seven (6%) of 120 patients. Endoscopic US was suboptimal in 17 (14%) of 120 patients. In contrast, CT was always feasible.
|
|
|
| DISCUSSION |
|---|
|
|
|---|
CT generally is considered the state-of-the-art diagnostic modality in the monitoring of nonsurgical therapy for solid tumors. For therapeutic response assessment in esophageal cancer, however, accuracy was low with CT. This is probably related to the difficulty in the differentiation between viable tumor and reactive changes, including edema and scar tissue, at CT. Another reason may be the knowledge that CT is of limited value for initial tumor staging, because of poor differentiation between tumors with stages from T1 to T3. It is far less accurate than is endoscopic US, which is presently considered the standard for staging of the primary tumor. Although only a limited number of studies with far from optimal methods were available, we conclude that singledetector row CT is a poor diagnostic tool for the determination of the pathologic tumor response after chemoradiotherapy in patients with esophageal cancer.
The articles with CT included in this review concerned studies in which thick section thickness (810 mm) was used. With the introduction of multi-section scanners, the use of thinner section thickness has become possible. Thinner section thickness can be expected to lead to improved delineation of the tumor and, hence, improved three-dimensional measurements and more accurate and reproducible measurements of tumor volume. This may improve the results of CT in the near future for monitoring of the response with respect to tumor volume.
Endoscopic US has been developed over the past decade and has been shown to be highly accurate for the initial staging of the primary tumor. In contrast to staging of untreated esophageal cancer, the value of restaging with endoscopic US after neoadjuvant therapy appears to be limited by the difficulty in the differentiation of residual carcinoma from inflammation and fibrosis (26,31). This limitation is supported by the finding that restaging with endoscopic US after therapy had a poor accuracy, with pathologic staging as the reference standard; accuracy ranged from 27% to 82%. Nevertheless, the accuracy of endoscopic US for therapeutic response assessment in our review was significantly higher than the accuracy of CT and was comparable to the accuracy of FDG PET. These results should be interpreted with caution, because they were based on four studies with poor to moderate methodological quality.
Factors that may limit endoscopic US for assessment of a nonsurgical response are postradiation esophagitis, luminal stenosis, and compression of the tumor caused by the endoscope (26,50). In the four studies with endoscopic US that were included for ROC analysis in this review, endoscopic US was not feasible in 6% (23,28) and was suboptimal in 14% of patients (33). In addition, to be useful for therapeutic response assessment, performance of repeated measurements with high reproducibility is of crucial importance, and this can only be achieved if well-trained and experienced operators are available.
Overall accuracy of FDG PET was similar to that of endoscopic US. The maximum joint sensitivity and specificity was 85% and 86%, respectively, for these two modalities. Metabolic imaging with FDG PET enables discrimination of viable tumor from necrotic scar tissue in patients with esophageal cancer after neoadjuvant therapy. Nonspecific glucose uptake by tissue with inflammation caused by chemotherapy and/or radiation therapy may be a limitation for FDG PET in therapeutic response assessment. Although the suboptimal sensitivity caused by false-negative results for response assessment in some studies (44,46) could be explained by this mechanism, this finding is rare and was not observed in other studies (23,45). Most likely, the contribution of FDG uptake caused by inflammation to total FDG uptake is low (51), and, moreover, the threshold for FDG decrease to define responders can be adjusted to select responders almost exclusively (ie, sensitivity of 100% with 100% negative predictive values only as performed by Weber et al [45] and Kroep et al [23]).
A wide range of cutoff values, from 30% to 80% and expressed as SUV, for reduction in FDG uptake has been reported for use in the discrimination between responders and nonresponders. This range of cutoff values suggests a lack of standardization, but it also might be explained by factors such as spectrum of disease, differences in therapy (eg, effectiveness in eradication of tumor), and timing of evaluation of the therapy. Kroep et al (23) used an optimal cutoff value of 40% after two cycles and of 60% after completion (four to six cycles) of chemotherapy. Still, an advantage of FDG PET is the high reproducibility (5254), with a change of greater than 20% considered to reflect a true biologic effect (51).
Furthermore, various analytic models (SUV with corrections for body surface area, simplified kinetic model, nonlinear regression analysis, etc) are available and have been tested to optimize therapeutic response monitoring by using FDG PET. These models have been described in detail by Hoekstra et al (55) and have been described particularly for esophageal cancer by Kroep et al (23). Because assessment of SUV combines accuracy with simplicity, this model is used in most studies. Further research and validation, with a focus on the cutoff values by the different analytic models for optimal discrimination between responders and nonresponders, has to be done.
For this review, we used histopathologic findings as a valid reference standard. The ultimate aim, however, was to identify patients with a poor clinical outcome early in the course of neoadjuvant therapy; in only seven studies was survival reported, and these studies included one with CT (21), two with endoscopic US (33,34), and four with FDG PET (4346). In the study with CT, outcome was not predicted, whereas in both studies with endoscopic US, a slightly but significantly higher survival was found in responders versus nonresponders. In all four studies with FDG PET, striking differences with significantly longer survival in metabolically responding patients were found.
Response to neoadjuvant treatment was evaluated after completion of this nonsurgical treatment in most studies. These findings may hold prognostic value as mentioned previously, but they will not alter the duration or intensity of neoadjuvant treatment. Only if evaluation is performed early in the course of neoadjuvant treatment and the test accurately discriminates responders from nonresponders can the test results be used to aid in the decision about whether this toxic therapy should be continued, as in responders, or stopped, as in nonresponders. Researchers in only two studies (23,45) investigated the role of FDG PET in distinguishing responders from nonresponders early during the course of chemoradiotherapy. Kroep et al (23) and Weber et al (45) showed encouraging results, with high negative predictive values for the early response to neoadjuvant treatment in patients with esophageal cancer (95% and 100%, respectively).
Early prediction of tumor response might be improved by taking other factors into account. For esophageal cancer, several histologic markers, such as the tumor suppressor gene p53 (56), the proliferative marker Ki-67, and the epidermal growth factor receptor, have been evaluated for the prediction of the therapeutic response even prior to neoadjuvant therapy. Neither a single marker nor a combination of markers, however, can currently be used to predict the response with sufficient accuracy. In the future, gene expression profiling may help to identify markers that can be used in combination with imaging results for prediction of the response to neoadjuvant therapy.
In addition, there are some general limitations to this review, which should be kept in mind when one interprets the summary ROC curves. The number of studies and patients included in our review was small. Only 11 studies for the three modalities combined were eligible for ROC analysis, which provided a total of 318 patients.
In none of the studies was a head-to-head comparison used to test directly for a difference in accuracy between modalities within the same group of patients. Although Kroep et al (23) evaluated all three modalities in the same patient population, they did not directly compare accuracy, possibly because of the small number of patients who could be evaluated and who underwent examinations with all three modalities.
Some potential sources of heterogeneity need to be addressed. First, neoadjuvant therapeutic schemes differed substantially among the studies, with a major difference in the use of chemotherapy alone in four studies versus the combination of chemotherapy and radiation therapy in seven studies. Second, differences in the spectrum of disease can be an important source of heterogeneity. We restricted our review to studies that included patients who were eligible for curative surgery and, therefore, were candidates for neoadjuvant treatment. Nevertheless, tumor stage may vary from small T1 tumors to large T3 tumors. Measurements of change in tumor volume or of FDG uptake may be less accurate in small tumors than they are in large tumors because of the larger influence of systematic errors and lower accuracy. Moreover, small tumors may be missed at CT or FDG PET at initial staging. Subanalyses in future studies with larger patient populations are necessary to document the influence of the initial tumor stage on therapeutic response assessment. Third, it is important to know whether researchers defined their cutoff value prior to the study or after they obtained the results. In the latter case, there is an increased likelihood that the authors selected the cutoff value to maximize a particular test characteristic, which reduces the likelihood that investigators in another study will be able to replicate the findings. Different methods or thresholds to define a positive test result can lead to a special source of variation. Response criteria were defined differently within each modality (Table 2). Two different parameters were used for endoscopic US (ie, restaging and maximal cross-sectional area). In all studies with CT, volume measurements were used, but the thresholds differed slightly. In all studies with FDG PET, except that of Flamen et al (46), the SUV was used. Flamen et al used the delta tumor-to-liver uptake ratio, which is comparable with the SUV, albeit with correction for liver activity. These differences, however, are small, and the summary ROC approach was developed to incorporate differences in accuracy that arise because of a change in threshold between studies for defining a test as positive.
In conclusion, single-section CT for assessment of the response to neoadjuvant therapy in esophageal cancer is inaccurate and, therefore, is not recommended. Endoscopic US and FDG PET have equivalent accuracy. Endoscopic US can be used to identify patients in whom a pathologic response was achieved, but it is not always feasible during or shortly after chemoradiation and, therefore, is not routinely used for therapeutic response assessment. FDG PET, which is used to measure alterations in tissue metabolism, seems to be a promising noninvasive tool for assessment of the response to neoadjuvant therapy in patients with esophageal cancer. Studies in larger patient populations with sufficient power, which focus on the prediction of tumor response early in the course of neoadjuvant therapy and which include direct comparisons of different modalities, are needed.
| FOOTNOTES |
|---|
Abbreviations: FDG = fluorine 18 fluorodeoxyglucose ROC = receiver operating characteristic SUV = standardized uptake value
Authors stated no financial relationship to disclose.
Author contributions: Guarantors of integrity of entire study, J.B.R., J.T.M.P., G.W.S.; study concepts/study design or data acquisition or data analysis/interpretation, all authors; manuscript drafting or manuscript revision for important intellectual content, all authors; approval of final version of submitted manuscript, all authors; literature research, M.W., H.L.v.W.; statistical analysis, J.B.R.; manuscript editing, all authors
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. J. Kim, H. Y. Kim, K. W. Lee, and M. S. Kim Multimodality Assessment of Esophageal Cancer: Preoperative Staging and Monitoring of Response to Therapy RadioGraphics, March 1, 2009; 29(2): 403 - 421. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ben-Haim and P. Ell 18F-FDG PET and PET/CT in the Evaluation of Cancer Treatment Response J. Nucl. Med., January 1, 2009; 50(1): 88 - 99. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. I. Khushalani Cancer of the Esophagus and Stomach Mayo Clin. Proc., June 1, 2008; 83(6): 712 - 722. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. Bruzzi, R. F. Munden, M. T. Truong, E. M. Marom, B. S. Sabloff, G. W. Gladish, R. B. Iyer, T.-S. Pan, H. A. Macapinlac, and J. J. Erasmus PET/CT of Esophageal Cancer: Its Role in Clinical Management RadioGraphics, November 1, 2007; 27(6): 1635 - 1652. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bedenne and F. Bonnetain In Reply J. Clin. Oncol., September 1, 2007; 25(25): 4026 - 4027. [Full Text] [PDF] |
||||
![]() |
G. Piessen, N. Briez, J.-P. Triboulet, and C. Mariette Patients with Locally Advanced Esophageal Carcinoma Nonresponder to Radiochemotherapy: Who Will Benefit From Surgery? Ann. Surg. Oncol., July 1, 2007; 14(7): 2036 - 2044. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Weber and R. Figlin Monitoring Cancer Treatment with PET/CT: Does It Make a Difference? J. Nucl. Med., January 1, 2007; 48(1_suppl): 36S - 44S. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Apisarnthanarax, M. M. Alauddin, F. Mourtada, H. Ariga, U. Raju, O. Mawlawi, D. Han, W. G. Bornmann, J. A. Ajani, L. Milas, et al. Early Detection of Chemoradioresponse in Esophageal Carcinoma by 3'-Deoxy-3'-3H-Fluorothymidine Using Preclinical Tumor Models Clin. Cancer Res., August 1, 2006; 12(15): 4590 - 4597. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |