|
|
||||||||
Perspectives |
J Sim, PhD, is Professor, Primary Care Sciences Research Centre, Keele University, Keele, Staffordshire ST5 5BG, United Kingdom (j.sim{at}keele.ac.uk)
CC Wright, BSc, is Principal Lecturer, School of Health and Social Sciences, Coventry University, Coventry, United Kingdom
Address all correspondence to Dr Sim
Purpose. This article examines and illustrates the use and interpretation of the kappa statistic in musculoskeletal research. Summary of Key Points. The reliability of clinicians' ratings is an important consideration in areas such as diagnosis and the interpretation of examination findings. Often, these ratings lie on a nominal or an ordinal scale. For such data, the kappa coefficient is an appropriate measure of reliability. Kappa is defined, in both weighted and unweighted forms, and its use is illustrated with examples from musculoskeletal research. Factors that can influence the magnitude of kappa (prevalence, bias, and nonindependent ratings) are discussed, and ways of evaluating the magnitude of an obtained kappa are considered. The issue of statistical testing of kappa is considered, including the use of confidence intervals, and appropriate sample sizes for reliability studies using kappa are tabulated. Conclusions. The article concludes with recommendations for the use and interpretation of kappa.
Key Words: Kappa Measurement Reliability Sample size
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
J. Barber, S. Muller, T. Whitehurst, and E. Hay Measuring morbidity: self-report or health care records? Fam. Pract., February 1, 2010; 27(1): 25 - 30. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Opriessnig, J. S. Bender, and P. G. Halbur Development and validation of an immunohistochemical method for rapid diagnosis of swine erysipelas in formalin-fixed, paraffin-embedded tissue samples J Vet Diagn Invest, January 1, 2010; 22(1): 86 - 90. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Clarkson, M. Abendstern, C. Sutcliffe, J. Hughes, and D. Challis Reliability of needs assessments in the community care of older people: impact of the single assessment process in England J Public Health, December 1, 2009; 31(4): 521 - 529. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Kristensen, L. Andersen, R. Bech-Jensen, M. Moos, B. Hovmand, C. Ekdahl, and H. Kehlet High intertester reliability of the Cumulated Ambulation Score for the evaluation of basic mobility in patients with hip fracture Clinical Rehabilitation, December 1, 2009; 23(12): 1116 - 1123. [Abstract] [PDF] |
||||
![]() |
H. L. Fisher, T. K. Craig, P. Fearon, K. Morgan, P. Dazzan, J. Lappin, G. Hutchinson, G. A. Doody, P. B. Jones, P. McGuffin, et al. Reliability and Comparability of Psychosis Patients' Retrospective Reports of Childhood Abuse Schizophr Bull, October 7, 2009; (2009) sbp103v2. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-J. Juan, H.-C. Chang, C.-J. Hsueh, H.-S. Liu, Y.-C. Huang, H.-W. Chung, C.-Y. Chen, H.-W. Kao, and G.-S. Huang Salivary Glands: Echo-Planar versus PROPELLER Diffusion-weighted MR Imaging for Assessment of ADCs Radiology, October 1, 2009; 253(1): 144 - 152. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-H. Li, Y.-S. Cheng, Y.-D. Li, C. Fang, S.-W. Chen, W. Wang, D.-J. Hu, and H.-W. Xu Large-Cohort Comparison Between Three-Dimensional Time-of-Flight Magnetic Resonance and Rotational Digital Subtraction Angiographies in Intracranial Aneurysm Detection Stroke, September 1, 2009; 40(9): 3127 - 3129. [Abstract] [Full Text] [PDF] |
||||
![]() |
M N Storm-Versloot, D T Ubbink, V Chin a Choi, and J S K Luitse Observer agreement of the Manchester Triage System and the Emergency Severity Index: a simulation study Emerg. Med. J., August 1, 2009; 26(8): 556 - 560. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Vasudevan, C. J. Etzel, M. R. Spitz, and A. V. Wilkinson Maternal current smoking: Concordance between adolescent proxy and mother's self-report Nicotine Tob Res, August 1, 2009; 11(8): 1016 - 1019. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Roddy, S. Muller, and E. Thomas Defining disabling foot pain in older adults: further examination of the Manchester Foot Pain and Disability Index Rheumatology, August 1, 2009; 48(8): 992 - 996. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. S Freedman, P. T Katzmarzyk, W. H Dietz, S. R Srinivasan, and G. S Berenson Relation of body mass index and skinfold thicknesses to cardiovascular disease risk factors in children: the Bogalusa Heart Study Am. J. Clinical Nutrition, July 1, 2009; 90(1): 210 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Bakic, A.-K. Carton, D. Kontos, C. Zhang, A. B. Troxel, and A. D. A. Maidment Breast Percent Density: Estimation on Digital Mammograms and Central Tomosynthesis Projections1 Radiology, July 1, 2009; 252(1): 40 - 49. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. H. M. Antommaria, K. Trotochaud, K. Kinlaw, P. N. Hopkins, and J. Frader Policies on Donation After Cardiac Death at Children's Hospitals: A Mixed-Methods Analysis of Variation JAMA, May 13, 2009; 301(18): 1902 - 1908. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. R. Bakic, A.-K. Carton, D. Kontos, C. Zhang, A. B. Troxel, and A. D. A. Maidment Breast Percent Density: Estimation on Digital Mammograms and Central Tomosynthesis Projections Radiology, May 6, 2009; (2009) 2521081621. [Abstract] [Full Text] |
||||
![]() |
P. J. Karanicolas, M. Bhandari, H. Kreder, A. Moroni, M. Richardson, S. D. Walter, G. R. Norman, G. H. Guyatt, and on Behalf of the Collaboration for Outcome Assessm Evaluating Agreement: Conducting a Reliability Study J. Bone Joint Surg. Am., May 1, 2009; 91(Supplement_3): 99 - 106. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Large and O. Nielssen Factors Associated With Agreement Between Experts in Evidence About Psychiatric Injury J Am Acad Psychiatry Law, December 1, 2008; 36(4): 515 - 521. [Abstract] [Full Text] [PDF] |
||||
![]() |
N K S Wong, F Y Ng, and G Leung Cytological distinction between high-risk and low-risk human papillomavirus infections in SurePath liquid-based cell preparations J. Clin. Pathol., December 1, 2008; 61(12): 1317 - 1322. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. McLawsen, R. L. Jackson, S. D. Vannoy, G. J. Gagliardi, and M. J. Scalora Professional Perspectives on Sexual Sadism Sexual Abuse: A Journal of Research and Treatment, September 1, 2008; 20(3): 272 - 304. [Abstract] [PDF] |
||||
![]() |
C. A. Wyse, K. A. McNie, V. J. Tannahil, J. K. Murray, and S. Love Prevalence of obesity in riding horses in Scotland Vet Rec., May 3, 2008; 162(18): 590 - 591. [Full Text] [PDF] |
||||
![]() |
C. K. Yiannakopoulos, A. Chougle, A. Eskelinen, J. P. Hodgkinson, and G. Hartofilakidis Inter- and intra-observer variability of the Crowe and Hartofilakidis classification systems for congenital hip disease in adults J Bone Joint Surg Br, May 1, 2008; 90-B(5): 579 - 583. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ruutu, G. Barosi, R. J. Benjamin, R. E. Clark, J. N. George, A. Gratwohl, E. Holler, M. Iacobelli, K. Kentouche, B. Lammle, et al. Diagnostic criteria for hematopoietic stem cell transplant-associated microangiopathy: results of a consensus process by an International Working Group Haematologica, January 1, 2007; 92(1): 95 - 100. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Svensson and C. Hager-Ross Hand function in Charcot Marie Tooth: test retest reliability of some measurements Clinical Rehabilitation, October 1, 2006; 20(10): 896 - 908. [Abstract] [PDF] |
||||
![]() |
H.-H. Wang, H.-F. Liao, and C.-L. Hsieh Reliability, Sensitivity to Change, and Responsiveness of the Peabody Developmental Motor Scales-Second Edition for Children With Cerebral Palsy Physical Therapy, October 1, 2006; 86(10): 1351 - 1359. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Manini, S. B. Cook, T. VanArnam, M. Marko, and L. Ploutz-Snyder Evaluating Task Modification as an Objective Measure of Functional Limitation: Repeatability and Comparability J Gerontol A Biol Sci Med Sci, July 1, 2006; 61(7): 718 - 725. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Barosi, D. Bordessoule, J. Briere, F. Cervantes, J.-L. Demory, B. Dupriez, H. Gisslinger, M. Griesshammer, H. Hasselbalch, R. Kusec, et al. Response criteria for myelofibrosis with myeloid metaplasia: results of an initiative of the European Myelofibrosis Network (EUMNET) Blood, October 15, 2005; 106(8): 2849 - 2853. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |