Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


DOI: 10.1148/radiol.2321031974
This Article
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mak, H. K. F.
Right arrow Articles by Chan, B. P. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mak, H. K. F.
Right arrow Articles by Chan, B. P. L.
(Radiology 2004;232:302-303.)


Letters to the Editor

Prevalence-adjusted Bias-adjusted {kappa} Values as Additional Indicators to Measure Observer Agreement [letter]

Henry K. F. Mak, FRCR,*

Department of Diagnostic Radiology, Yan Chai Hospital, 7–11 Yan Chai Street, Tsuen Wan, Hong Kong Special Administrative Region, China*

Kelvin K. W. Yau, PhD,{dagger}

Department of Management Sciences, City University of Hong Kong, Hong Kong Special Administrative Region, China{dagger}

Bernard P. L. Chan, MRCP{ddagger}

Department of Medicine, National University Hospital, Singapore{ddagger}. e-mail: kfmakhk@netvigator.com

Editor:

We read with much interest the article by Drs Kundel and Polansky in the August 2003 issue of Radiology (1), in which the {kappa} statistic was used to measure observer agreement in various screening and diagnostic imaging studies. As highlighted, in the presence of low disease prevalence, the {kappa} statistic should be interpreted with caution (2), and indexes of positive and negative agreement should also be reported (3). In addition, readers should be aware of the effect that bias between observers has on {kappa} values, where bias effect refers to the difference in observer assessment of the frequency occurrence, and prevalence effect refers to the difference between the probability of "positive" versus "negative" categories (4). In general, {kappa} values increase with an increased bias, and low prevalence results in a decrease in {kappa} values. In most cases, bias is not a major problem. However, the low prevalence level results in a substantial reduction in {kappa} values, which can be misleading.

Specific concern over the use of {kappa} values lies in the maximum attainable value, and hence, the underestimation of the strength of agreement for various ranges of {kappa} values according to Landis and Koch (5). As an illustration, consider the Table, which is a hypothetical 2 x 2 table.


View this table:
[in this window]
[in a new window]

 
2 x 2 Table

 
By assuming a disease prevalence of 0.2, the maximum attainable {kappa} value is found to be 0.375, so no matter how good the agreement between the observers, the strength of agreement is always "fair" according to the classification of Landis and Koch (5).

Our recent work in a computed tomographic study (6) reveals a similar problem caused by low disease prevalence when measuring rater agreement. The use of prevalence-adjusted bias-adjusted {kappa} values could be useful additionalindicators for measuring observer agreement (4). This is an adjusted measure that aims to alleviate the effect of bias and prevalence on {kappa} values. In particular, this method can be shown to relate to the observed proportion of agreement with the maximum attainable value always equal to +1. The adjusted {kappa} value (prevalence-adjusted bias-adjusted {kappa} value) in the previous example now becomes 0.6, which indicates "moderate" agreement according to the Landis and Koch classification (5).

Our experience has shown that the use of prevalence-adjusted bias-adjusted {kappa} values as additional agreement measures is especially important for the screening and diagnostic studies in the community setting, when a control for prevalence cannot be achieved easily. This is well exemplified by a previous study (7) on the interobserver reliability of chest radiographs in community-acquired pneumonia. The authors asserted that the {kappa} statistics in their study were lowered artifactually because of the presence of either high-prevalence findings (such as presence of infiltrate, alveolar vs interstitial) or low-prevalence findings (such as adenopathy or postobstructive pneumonia). This penalty imposed by the {kappa} coefficient because of an "unbalanced" challenge population was discussed thoroughly by Cicchetti and Feinstein (3).

We appreciate the authors’ efforts in reviewing this very important and useful statistical concept in diagnostic imaging studies.

REFERENCES

  1. Kundel HL, Polansky M. Measurement of observer agreement. Radiology 2003; 228:303-308.[Abstract/Free Full Text]
  2. Feinstein AR, Cicchetti DV. High agreement but low kappa. I. The problem of two paradoxes. J Clin Epidemiol 1990; 43:543-549.[CrossRef][Medline]
  3. Cicchetti DV, Feinstein AR. High agreement but low kappa. II. Resolving the paradoxes. J Clin Epidemiol 1990; 43:551-558.[CrossRef][Medline]
  4. Byrt T, Bishop J, Carlin JB. Bias, prevalence and kappa. J Clin Epidemiol 1993; 46:423-429.[CrossRef][Medline]
  5. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977; 33:159-174.[CrossRef][Medline]
  6. Mak HK, Yau KK, Khong PL, et al. Hypodensity of >1/3 middle cerebral artery territory versus Alberta Stroke Programme Early CT Score (ASPECTS): comparison of two methods of quantitative evaluation of early CT changes in hyperacute ischemic stroke in the community setting. Stroke 2003; 34:1194-1196.[Abstract/Free Full Text]
  7. Albaum MN, Hill LC, Murphy M, et al. Interobserver reliability of the chest radiograph in community-acquired pneumonia. Chest 1996; 110:343-350.[Abstract/Free Full Text]




This Article
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mak, H. K. F.
Right arrow Articles by Chan, B. P. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mak, H. K. F.
Right arrow Articles by Chan, B. P. L.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
RADIOLOGY RADIOGRAPHICS RSNA JOURNALS ONLINE