R2: a useful measure of model performance when predicting a dichotomous outcome
Department of Quantitative Health Sciences
*Hospital Mortality; Humans; Mathematical Computing; *Models, Statistical; Odds Ratio; Predictive Value of Tests; ROC Curve; Risk Assessment; Severity of Illness Index
Biostatistics | Epidemiology | Health Services Research
R2 has been criticized as a measure of model performance when predicting a dichotomous outcome, both because its value is often low and because it is sensitive to the prevalence of the event of interest. The C statistic is more widely used to measure model performance in a 0/1 setting. We use a simple parametric family of models to illustrate the potential usefulness of models with low R2 values, to clarify the effect of prevalence on both C and R2, and to demonstrate how R2 captures information not picked up by C. We also show that C is subject to a 'random mixing' problem that does not affect R2. Finally, we report both R2 and C values for different risk-adjustment models in situations with different prevalences and show the relationship between the measures and decile death rates, thereby providing a context for interpreting R2 values in a 0/1 setting.
Stat Med. 1999 Feb 28;18(4):375-84. Link to article on publisher's site
Statistics in medicine
Ash AS, Shwartz M. (1999). R2: a useful measure of model performance when predicting a dichotomous outcome. Population and Quantitative Health Sciences Publications. Retrieved from https://escholarship.umassmed.edu/qhs_pp/684