Use of Confidence Intervals for Verification Statistics

 

Tressa L. Fowler and Barbara G. Brown

National Center for Atmospheric Research

 

Frequently, the goal of a forecast verification study is to compare the quality of different forecasts or algorithms. Point estimates of forecast quality are inadequate for determining if one type of forecast or algorithm performs significantly better than another or whether forecasts have been improved. Confidence intervals provide an effective way to make these comparisons. Additionally, confidence intervals explicitly communicate the uncertainty associated with a measure of forecast quality in a way that point estimates cannot. However, these methods have been infrequently applied in forecast verification studies, due to the characteristics of the verification data and measures.

 

Classical statistical models, conditional models, and computer resampling methods can all be used to estimate confidence intervals for verification statistics. The assumptions, performance, and relative merits of each type of interval are presented.