Aleatoric and epistemic uncertainty in machine learning

02 Nov 22 00:00 UTC

Uncertainty quantification (UQ) in machine learning is the practice of measuring or estimating uncertainty in models. It is a set of tools to understand the limitations of models and predictions, and to make better decisions. UQ is a key …

Left: Even with precise knowledge about the optimal hypothesis, the prediction at the query point (indicated by a question mark) is aleatorically uncertain, because the two classes are overlapping in that region. Right: A case of epistemic uncertainty due to a lack of knowledge about the right hypothesis, which is in turn caused by a lack of data.

Michael Panchenko (appliedAI Initiative)

Mischa is a researcher with background in physics and mathematics who decided to change course and go into AI (for the sake of falsifiability of ideas). On his path since then he has worked on multiple projects in ML and data analysis and as a bonus gained some experience DevOps and in developing production grade solutions.

This talk introduces the notions of aleatoric and epistemic uncertainty for probabilistic models and gives an overview of recent developments in this area. It is mainly based on the corresponding review paper.

Aleatoric uncertainty arises from inherent randomness or variability in the data, such as measurement errors or natural variations, and cannot be easily reduced. Epistemic uncertainty, on the other hand, stems from lack of knowledge or information about the data or model, such as model misspecification or insufficient data, and can often be decreased by including more data or using better models. Unfortunately, in real-world applications, it is often difficult to distinguish between these two types of uncertainty.

Several methods for estimating and quantifying uncertainty are discussed, including calibration, likelihood-based methods, and conformal prediction. Less well-known methods, rooted outside of probability theory, are also introduced and their advantages and disadvantages are discussed.

References

[Hul21A]

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods, Eyke Hüllermeier, Willem Waegeman.

Mar 2021

The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues …

Publication

References

In this series →