Optimal decision-making based on classifiers requires that the confidence in their predictions reflect the actual error rates. When this happens, one speaks of a calibrated model. Recent work has shown that expressive neural networks are able to overfit the cross-entropy loss without losing accuracy, thus producing overconfident (i.e. miscalibrated) models.

Learning outcomes

Understanding quantitatively how optimal decisions depend on all probabilities and not just on the predicted class.
Learning to measure (mis)calibration of models.
Recalibration of classifiers during training: loss functions, regularisation.
Recalibration of classifiers, a posteriori: non-parametric, parametric, Bayesian.

Structure of the workshop

Introduction to the nomenclature: notions of calibration, metrics, calibration functions
Relation between calibration and accuracy
Discussion of situations when calibration is important for performance
Measuring miscalibration 1: histogram estimators
Measuring miscalibration 2: statistical tests and decision process simulation
Recalibration by post-processing
Training calibrated classifiers
Performance comparison of different approaches to calibration

Learning outcomes

Structure of the workshop

In this series →