|
|
||||||||
Statistical Concepts Series |
1 From the Department of Radiology, Brigham and Womens Hospital (K.H.Z., K.T., S.G.S.) and Department of Health Care Policy (K.H.Z.), Harvard Medical School, 180 Longwood Ave, Boston, MA 02115. Received September 10, 2001; revision requested October 31; revision received December 26; accepted January 21, 2002. Address correspondence to K.H.Z. (e-mail: zou@bwh.harvard.edu).
| ABSTRACT |
|---|
|
|
|---|
, for measuring linear and nonlinear relationships between two continuous variables. In the case of measuring the linear relationship between a predictor and an outcome variable, simple linear regression analysis is conducted. These statistical concepts are illustrated by using a data set from published literature to assess a computed tomographyguided interventional technique. These statistical methods are important for exploring the relationships between variables and can be applied to many radiologic studies. © RSNA, 2003
Index terms: Data analysis Statistical analysis
| INTRODUCTION |
|---|
|
|
|---|
Analyses between two variables may focus on (a) any association between the variables, (b) the value of one variable in predicting the other, and (c) the amount of agreement. Agreement will be discussed in a future article. Regression analysis focuses on the form of the relationship between variables, while the objective of correlation analysis is to gain insight into the strength of the relationship (1,2). Note that these two techniques are used to investigate relationships between continuous variables, whereas the
2 test is an example of a test for association between categorical variables. Continuous variables, such as procedure time, patient age, and number of lesions, have no gaps on the measurement scale. In contrast, categorical variables, such as patient sex and tissue classification based on segmentation, have gaps in their possible values. These two types of variables and the assumptions about their measurement scales were reviewed and distinguished in an article by Applegate and Crewson (3) published earlier in this Statistical Concepts Series in Radiology.
Specifically, the topics covered herein include two commonly used correlation coefficients, the Pearson correlation coefficient (4,5) and the Spearman
(610) for measuring linear and nonlinear relationship, respectively, between two continuous variables. Correlation analysis is often conducted in a retrospective or observational study. In a clinical trial, on the other hand, the investigator may also wish to manipulate the values of one variable and assess the changes in values of another variable. To evaluate the relative impact of the predictor variable on the particular outcome, simple regression analysis is preferred. We illustrate these statistical concepts with existing data to assess patient skin dose based on total procedure time by using a quick-check method in CT fluoroscopyguided abdominal interventions (11).
These statistical methods are useful tools for assessing the relationships between continuous variables collected from a clinical study. However, it is also important to distinguish these statistical methods. While they are similar mathematically, their purposes are different. Correlation analysis is generally overused. It is often interpreted incorrectly (to establish "causation") and should be reserved for generating hypotheses rather than for testing them. On the other hand, regression modeling is a more useful statistical technique that allows us to assess the strength of the relationships in the data and the uncertainty in the model by using confidence intervals (12,13).
| CORRELATION |
|---|
|
|
|---|
correlation coefficients. Both correlation coefficients take on values between -1 and +1, ranging from being negatively correlated (-1) to uncorrelated (0) to positively correlated (+1). The sign of the correlation coefficient (ie, positive or negative) defines the direction of the relationship. The absolute value indicates the strength of the correlation (Table 1, Fig 1). We elaborate on two correlation coefficients, linear (eg, Pearson) and rank (eg, Spearman), that are commonly used for measuring linear and general relationships between two variables.
|
|
Rank Correlation
The Spearman
is the sample correlation coefficient (rs) of the ranks (the relative order) based on continuous data (19,20). It was first introduced by Spearman in 1904 (6). The Spearman
is used to measure the monotonic relationship between two variables (ie, whether one variable tends to take either a larger or smaller value, though not necessarily linearly) by increasing the value of the other variable.
Linear versus Rank Correlation Coefficients
The Pearson correlation coefficient necessitates use of interval or continuous measurement scales of the measured outcome in the study population. In contrast, rank correlations also work well with ordinal rating data, and continuous data are reduced to their ranks (Appendix C) (20,21). The rank procedure will also be illustrated briefly with our example data. The smallest value in the sample has rank 1, and the largest has the highest rank. In general, rank correlations are not easily influenced by the presence of skewed data or data that are highly variable.
Statistical Hypothesis Tests for a Correlation Coefficient
The null hypothesis states that the underlying linear correlation has a hypothesized value,
0. The one-sided alternative hypothesis is that the underlying value exceeds (or is less than)
0. When the sample size (n) of the paired data is large (n
30 for each variable), the standard error (s) of the linear correlation (r) is approximately s(r) = (1 - r2)/
n. The test statistic value (r -
0)/s(r) may be computed by means of the z test (22). If the P value is below .05, the null hypothesis is rejected. The P value based on the Spearman
can be found in the literature (20,21).
Limitations and Precautions
It is worth noting that even if two variables (eg, cigarette smoking and lung cancer) are highly correlated, it is not sufficient proof of causation. One variable may cause the other or vice versa, or a third factor is involved, or a rare event may have occurred. To conclude causation, the causal variables must precede the variable it causes, and several conditions must be met (eg, reversibility, strength, and exposure response on the basis of the Bradford-Hill criteria or the Rubin causal model) (2326).
| SIMPLE LINEAR REGRESSION |
|---|
|
|
|---|
A simple regression model contains only one independent (explanatory) variable, Xi, for i = 1, ..., n subjects, and is linear with respect to both the regression parameters and the dependent variable. The corresponding dependent (outcome) variable is labeled. The model is expressed as
|
|
|
Least Squares Method
The main goal of linear regression is to fit a straight line through the data that predicts Y based on X. To estimate the intercept and slope regression parameters that determine this line, the least squares method is commonly used. It is not necessary for the errors to have a normal distribution, although the regression analysis is more efficient with this assumption (27). With this regression method, a set of regression parameters are found such that the sum of squared residuals (ie, the differences between the observed values of the outcome variable and the fitted values) are minimized (14). The fitted y value is then computed as a function of the given x value and the estimated intercept and slope regression parameter (Appendix D). For example, in Equation (1), once the estimates of a and b are obtained from the regression analysis, the predicted y value at any given x value is calculated as a + bx.
Coefficient of Determination, R2
It is meaningful to interpret the value of the Pearson correlation coefficient r by squaring it; hence, the term R-square (R2) or coefficient of determination. This measure (with a range of 01) is the fraction of the variability in Y that can be explained by the variability in X through their linear relationship, or vice versa. That is, R2 = SSregression/SStotal, where SS stands for the sum of squares. Note that R2 is calculated only on the basis of the Pearson correlation coefficient in the linear regression analysis. Thus, it is not appropriate to compute R2 on the basis of rank correlation coefficients such as the Spearman
.
Statistical Hypothesis Tests
There are several hypotheses in the context of regression analysis, for example, to test if the slope of the regression line is b = 0 (hypothesis, there is no linear association between Y and X). One may also test whether intercept a takes on a certain value. The significance of the effects of the intercept and slope may also be computed by means of a Student t statistic introduced earlier in this Statistical Concepts Series in Radiology (30).
Limitations and Precautions
The following understandings should be considered when regression analysis is performed. (a) To understand whether the assumptions have been met, determine the magnitude of the gap between the data and the assumptions of the model. (b) No matter how strong a relationship is demonstrated with regression analysis, it should not be interpreted as causation (as in the correlation analysis). (c) The regression should not be used to predict or estimate outside the range of values of the independent variable of the sample (eg, extrapolation of radiation cancer risk from the Hiroshima data to that of diagnostic radiologic tests).
| AN EXAMPLE: DOSE VERSUS TOTAL PROCEDURE TIME IN CT FLUOROSCOPY |
|---|
|
|
|---|
Earlier, we discussed two methods to target lesions with CT fluoroscopy. In one method, continuous CT scanning is used during needle placement. In the other method, short CT scanning is used to image the needle after it is placed. The latter method, the so-called quick-check method, has been adopted almost exclusively at our institution. Now, we demonstrate correlation and regression analyses based on a subset of the interventional procedures (n = 19). With the quick-check method, we examine the relationship between total procedure time (in minutes) and dose (in rads) on a natural log scale. We also examine the marginal ranks of the x (log of total time) and y (log of dose) components (Table 2). For convenience, the x data are given in ascending order.
|
The natural log (ln) transformation of the total time is used to make the data appear normal, for more efficient analysis (Appendix D), with normality verified statistically (31). However, normality is not necessary in the subsequent regression analysis. We created a scatterplot of the data, with the log of dose (ln[rad]) on the x axis and the log of total time (ln[minutes]) on the y axis (Fig 3).
|
Correlations
To compute the Spearman
with a Pearson correlation coefficient of r = 0.85, the marginal ranks of time and dose were derived separately; consequently, rs = 0.84. Both correlation coefficients confirm that the log of total time and the log of dose are correlated strongly and positively.
Regression
We first conducted a simple linear regression analysis of the data on a log scale (n = 19); results are shown in Table 3. The value calculated for R2 was 0.73, which suggests that 73% of the variability of the data could be explained by the linear regression.
|
|
| SUMMARY AND REMARKS |
|---|
|
|
|---|
When correlation analysis is conducted to measure the association between two random variables, either the Pearson linear correlation coefficient or the Spearman rank correlation coefficient
may be adopted. The former coefficient is used to measure the linear relationship but is not recommended for use with skewed data or data with extremely large or small values (often called the outliers). In contrast, the latter coefficient is used to measures a general association, and it is recommended for use with data that are skewed or that have outliers.
When simple regression analysis is conducted to assess the linear relationship of a dependent variable as a function of the independent variable, caution must be used when determining which of the two variables is viewed as the independent variable that makes sense clinically. A useful graphical aid is a scatterplot. Once the regression line is obtained, caution should also be used to avoid prediction of a y value for any value of x that is outside the range of the data. Finally, correlation and regression analyses do not infer causality, and more rigorous analyses are required if causal inference is to be made (2326).
| APPENDIX A |
|---|
|
|
|---|
|
and
are the sample means of the Xi and Yi values, respectively. The Pearson correlation coefficient may be computed by means of a computer-based statistics program (Excel; Microsoft) by using the option "Correlation" under the option "Data Analysis Tools". Alternatively, it may also be computed by means of a built-in software function "Cor" (Insightful; MathSoft, Seattle, Wash [MathSoft S-Plus 4 guide to statistics, 1997; 8996]. Available at: www.insightful.com) or with a free software program (R Software. Available at: lib.stat.cmu.edu/R).
| APPENDIX B |
|---|
|
|
|---|
= type I error (ie, significance level, typically fixed at 0.05), ß = type II error (ie, 1 minus statistical power, typically fixed at 0.10). Then N = [(Z
+ Zß)/C]2 + 3, where Z
is the inverse of the cumulative probability of a standard normal distribution with the tail probability of
. Similarly, Zß is the inverse of the cumulative probability of a standard normal distribution with the tail probability of ß. Consequently, compute the smallest integer, n, such that n
N, as the required sample size.
For example, an investigator wishes to conduct a clinical trial of a paired design based on a one-tailed hypothesis test of the correlation coefficient. The null hypothesis is that the correlation between two variables is r = 0.60 (ie, C = 0.693) in the population of interest. The alternative hypothesis is that the correlation is r > 0.60. Type I error is fixed to be 0.05 (ie, Z
= 1.645), while type II error is fixed to be 0.10 (ie, Zß = 1.282). Thus, the required sample size is N = 21 subjects. A sample size table may also be found in reference 18.
| APPENDIX C |
|---|
|
|
|---|
and Pearson rs: Replace bivariate data, Xi and Yi (i = 1,...,n), by their respective ranks Ri = rank(Xi) and Si = rank(Yi). Rank correlation coefficient, rs, is defined as the Pearson correlation coefficient between the Ri and Si values, which can be computed by means of the formula given in Appendix A. An alternative direct formula was given by Hettmansperger (19).
The Spearman
may also be computed by first reducing the continuous data to their marginal ranks by using the "rank and percentile" option with Data Analysis Tools (Excel; Microsoft) or the "rank" function (Insightful; MathSoft) or the free software. Both software programs correctly rank the data in ascending order. However, the rank and percentile option in Excel ranks the data in descending order (the largest is 1). Therefore, to compute the correct ranks, one may first multiply all of the data by -1 and then apply the rank function. Excel also gives integer ranks in the presence of ties compared with the methods that yield possible noninteger ranks, as described in the standard statistics literature (19).
Subsequently, the sample correlation coefficient is computed on the basis of the ranks of the two marginal data by using the Correlation option in Data Analysis Tools (Excel; Microsoft) or by using the Cor function (Insightful; MathSoft) or the free software.
| APPENDIX D |
|---|
|
|
|---|
With either program, one may choose to transform the data or exclude outliers before conducting a simple regression analysis. A commonly used variance-stabilizing transformation is the natural log function (ln) applied to one or both variables. Other transformation (eg, Box-Cox transformation) and weighting methods in regression analysis may also be used (28,29).
| APPENDIX E |
|---|
|
|
|---|
| APPENDIX F |
|---|
|
|
|---|
Bivariate data.Measurements obtained on more than one variable for the same unit or subject.
Correlation coefficient.A statistic between -1 and 1 that measures the association between two variables.
Intercept.The constant a in the regression equation, which is the value for y when x = 0.
Least squares method.The regression line that is the best fit to the data for which the sum of the squared residuals is minimized.
Outlier.An extreme observation far away from the bulk of the data, often caused by faulty measuring equipment or recording error.
Pearson correlation coefficient.Sample correlation coefficient for measuring the linear relationship between two variables.
R2.The square of the Pearson correlation coefficient r, which is the fraction of the variability in Y that can be explained by the variability in X through their linear relationship or vice versa.
Rank.The relative ordering of the measurements in a variable, which can be noninteger numbers in the presence of ties.
Residual.The difference between the observed values of the outcome variable and the fitted values based on a linear regression analysis.
Scatterplot.A plot of the observed bivariate outcome variable (y axis) against its predictor variable (x axis), with a dot for each pair of bivariate observations.
Simple linear regression analysis.A linear regression analysis with one predictor and one outcome variable.
Skewed data.A distribution is skewed if there are more extreme data on one side of the mean. Otherwise, the distribution is symmetric.
Slope.The constant b in the regression equation, which is the change in y that corresponds to a one-unit increase (or decrease) in x.
Spearman
.A rank correlation coefficient for measuring the monotone relationship between two variables.
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
N. Salameh, B. Larrat, J. Abarca-Quinones, S. Pallu, M. Dorvillius, I. Leclercq, M. Fink, R. Sinkus, and B. E. Van Beers Early Detection of Steatohepatitis in Fatty Rat Liver by Using MR Elastography Radiology, July 8, 2009; (2009) 2523081817. [Abstract] [Full Text] |
||||
![]() |
M. R. Oliva, J. N. Glickman, K. H. Zou, S. Y. Teo, K. J. Mortele, M. S. Rocha, and S. G. Silverman Renal Cell Carcinoma: T1 and T2 Signal Intensity Characteristics of Papillary and Clear Cell Types Correlated with Pathology Am. J. Roentgenol., June 1, 2009; 192(6): 1524 - 1530. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Horsthuis, C. Lavini, S. Bipat, P. C. F. Stokkers, and J. Stoker Perianal Crohn Disease: Evaluation of Dynamic Contrast-enhanced MR Imaging as an Indicator of Disease Activity Radiology, May 1, 2009; 251(2): 380 - 387. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stadlbauer, E. Salomonowitz, G. Strunk, T. Hammen, and O. Ganslandt Age-related Degradation in the Central Nervous System: Assessment with Diffusion-Tensor Imaging and Quantitative Fiber Tracking Radiology, April 1, 2008; 247(1): 179 - 188. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Xiang, R. M. Minter, X. Bi, P. J. Woolf, and Y. He miniTUBA: medical inference by network integration of temporal data using Bayesian analysis Bioinformatics, September 15, 2007; 23(18): 2423 - 2432. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Haider, S. R. H. Davidson, A. V. Kale, R. A. Weersink, A. J. Evans, A. Toi, M. R. Gertner, A. Bogaards, B. C. Wilson, J. L. Chin, et al. Prostate Gland: MR Imaging Appearance after Vascular Targeted Photodynamic Therapy with Palladium-Bacteriopheophorbide Radiology, July 1, 2007; 244(1): 196 - 204. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stadlbauer, C. Nimsky, S. Gruber, E. Moser, T. Hammen, T. Engelhorn, M. Buchfelder, and O. Ganslandt Changes in Fiber Integrity, Diffusivity, and Metabolism of the Pyramidal Tract Adjacent to Gliomas: A Quantitative Diffusion Tensor Fiber Tracking and MR Spectroscopic Imaging Study AJNR Am. J. Neuroradiol., March 1, 2007; 28(3): 462 - 469. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Stadlbauer, O. Ganslandt, R. Buslei, T. Hammen, S. Gruber, E. Moser, M. Buchfelder, E. Salomonowitz, and C. Nimsky Gliomas: Histopathologic Evaluation of Changes in Directionality and Magnitude of Water Diffusion at Diffusion-Tensor MR Imaging Radiology, September 1, 2006; 240(3): 803 - 810. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Weber, S. Zoubaa, M. Schlieter, E. Juttler, H. B. Huttner, K. Geletneky, C. Ittrich, M. P. Lichy, A. Kroll, J. Debus, et al. Diagnostic performance of spectroscopic and perfusion MRI for distinction of brain tumors Neurology, June 27, 2006; 66(12): 1899 - 1906. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-A. Weber, H. Krakowski-Roosen, S. Delorme, H. Renk, M. Krix, J. Millies, R. Kinscherf, A. Kunkele, H.-U. Kauczor, and W. Hildebrandt Relationship of skeletal muscle perfusion measured by contrast-enhanced ultrasonography to histologic microvascular density. J. Ultrasound Med., May 1, 2006; 25(5): 583 - 591. [Abstract] [Full Text] [PDF] |
||||
![]() |
I.-F. Talos, K. H. Zou, L. Ohno-Machado, J. G. Bhagwat, R. Kikinis, P. M. Black, and F. A. Jolesz Supratentorial Low-Grade Glioma Resectability: Statistical Predictive Analysis Based on Anatomic MR Features and Tumor Characteristics. Radiology, May 1, 2006; 239(2): 506 - 513. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Florie, M. N. J. M. Wasser, K. Arts-Cieslik, E. M. Akkerman, P. D. Siersema, and J. Stoker Dynamic Contrast-Enhanced MRI of the Bowel Wall for Assessment of Disease Activity in Crohn's Disease. Am. J. Roentgenol., May 1, 2006; 186(5): 1384 - 1392. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. H. Zou, D. N. Greve, M. Wang, S. D. Pieper, S. K. Warfield, N. S. White, S. Manandhar, G. G. Brown, M. G. Vangel, R. Kikinis, et al. Reproducibility of Functional MR Imaging: Preliminary Results of Prospective Multi-institutional Study Performed by Biomedical Informatics Research Network Radiology, December 1, 2005; 237(3): 781 - 789. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. S. Stearman, L. Dwyer-Nield, L. Zerbe, S. A. Blaine, Z. Chan, P. A. Bunn Jr., G. L. Johnson, F. R. Hirsch, D. T. Merrick, W. A. Franklin, et al. Analysis of Orthologous Gene Expression between Human Pulmonary Adenocarcinoma and a Carcinogen-Induced Murine Model Am. J. Pathol., December 1, 2005; 167(6): 1763 - 1775. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Andreisek, M. Kilgus, D. Burg, N. Saupe, D. W. Crook, V. Meyer, B. Marincek, and D. Weishaupt MRI of the Intrinsic Muscles of the Hand: Spectrum of Imaging Findings and Clinical Correlation Am. J. Roentgenol., October 1, 2005; 185(4): 930 - 939. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Goldberg-Zimring, B. Shalmon, K. H. Zou, H. Azhari, D. Nass, and A. Achiron Assessment of Multiple Sclerosis Lesions with Spherical Harmonics: Comparison of MR Imaging and Pathologic Findings Radiology, June 1, 2005; 235(3): 1036 - 1044. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Bezerra, G. D'Ippolito, S. Faintuch, J. Szejnfeld, and M. Ahmed Determination of Splenomegaly by CT: Is There a Place for a Single Measurement? Am. J. Roentgenol., May 1, 2005; 184(5): 1510 - 1513. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Memarsadeghi, G. Heinz-Peer, T. H. Helbich, C. Schaefer-Prokop, G. Kramer, M. Scharitzer, and M. Prokop Unenhanced Multi-Detector Row CT in Patients Suspected of Having Urinary Stone Disease: Effect of Section Width on Diagnosis Radiology, May 1, 2005; 235(2): 530 - 536. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. J. Levy, R. M. Chiavacci, S. C. Nicolson, J. J. Rome, R. J. Lin, M. A. Helfaer, and V. M. Nadkarni An Evaluation of a Noninvasive Cardiac Output Measurement Using Partial Carbon Dioxide Rebreathing in Children Anesth. Analg., December 1, 2004; 99(6): 1642 - 1647. [Abstract] [Full Text] [PDF] |
||||
![]() |
I-P. Hsueh, W.-C. Wang, C.-F. Sheu, and C.-L. Hsieh Rasch Analysis of Combining Two Indices to Assess Comprehensive ADL Function in Stroke Patients Stroke, March 1, 2004; 35(3): 721 - 726. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. F. Gareen and C. Gatsonis Primer on Multiple Regression Models for Diagnostic Imaging Research Radiology, November 1, 2003; 229(2): 305 - 310. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |