Interobserver agreement issues in radiology

M Benchoufi, E Matzner-Lober, N Molinari… - Diagnostic and …, 2020 - Elsevier
Diagnostic and interventional imaging, 2020Elsevier
Agreement between observers (ie, inter-rater agreement) can be quantified with various
criteria but their appropriate selections are critical. When the measure is qualitative (nominal
or ordinal), the proportion of agreement or the kappa coefficient should be used to evaluate
inter-rater consistency (ie, inter-rater reliability). The kappa coefficient is more meaningful
that the raw percentage of agreement, because the latter does not account for agreements
due to chance alone. When the measures are quantitative, the intraclass correlation …
Abstract
Agreement between observers (i.e., inter-rater agreement) can be quantified with various criteria but their appropriate selections are critical. When the measure is qualitative (nominal or ordinal), the proportion of agreement or the kappa coefficient should be used to evaluate inter-rater consistency (i.e., inter-rater reliability). The kappa coefficient is more meaningful that the raw percentage of agreement, because the latter does not account for agreements due to chance alone. When the measures are quantitative, the intraclass correlation coefficient (ICC) should be used to assess agreement but this should be done with care because there are different ICCs so that it is important to describe the model and type of ICC being used. The Bland-Altman method can be used to assess consistency and conformity but its use should be restricted to comparison of two raters.
Elsevier