Misplaced Pages

Bennett, Alpert and Goldstein's S

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources.
Find sources: "Bennett, Alpert and Goldstein's S" – news · newspapers · books · scholar · JSTOR (April 2013) (Learn how and when to remove this message)

Bennett, Alpert & Goldstein’s S is a statistical measure of inter-rater agreement. It was created by Bennett et al. in 1954.

Rationale for use

Bennett et al. suggested adjusting inter-rater reliability to accommodate the percentage of rater agreement that might be expected by chance was a better measure than simple agreement between raters. They proposed an index which adjusted the proportion of rater agreement based on the number of categories employed.

Mathematical formulation

The formula for S is

S = Q P a 1 Q 1 {\displaystyle S={\frac {QP_{a}-1}{Q-1}}}

where Q is the number of categories and Pa is the proportion of agreement between raters.

The variance of S is

Var ( S ) = ( Q Q 1 ) 2 P a ( P a 1 ) n 1 {\displaystyle \operatorname {Var} (S)=\left({\frac {Q}{Q-1}}\right)^{2}{\frac {P_{a}(P_{a}-1)}{n-1}}}

Notes

This statistic is also known as Guilford’s G. Guilford was the first person to use the approach extensively in the determination of inter-rater reliability.

References

  1. Bennett, EM; Alpert, R; Goldstein, AC (1954). "Communications through limited response questioning". Public Opinion Quarterly. 18 (3): 303–308. doi:10.1086/266520.
  2. Warrens, Matthijs J. (May 2012). "The effect of combining categories on Bennett, Alpert and Goldstein's". Statistical Methodology. 9 (3): 341–352. doi:10.1016/j.stamet.2011.09.001. hdl:1887/18383.
  3. Holley, JW; Guilford, JP (1964). "A note on the G index of agreement". Educ Psych Measurement. 24 (4): 749–753. doi:10.1177/001316446402400402. S2CID 143846590.
Statistics
Descriptive statistics
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Data collection
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical inference
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical / Multivariate / Time-series / Survival analysis
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Categories: