Understanding Black-box Predictions via Influence Functions

Reference

Understanding Black-box Predictions via Influence Functions, Pang Wei Koh, Percy Liang. Proceedings of the 34th International Conference on Machine Learning(2017)

Publication

Abstract

How can we explain the predictions of a black-box model? In this paper, we use influence functions — a classic technique from robust statistics — to trace a model’s prediction through the learning algorithm and back to its training data, thereby identifying training points most responsible for a given prediction. To scale up influence functions to modern machine learning settings, we develop a simple, efficient implementation that requires only oracle access to gradients and Hessian-vector products. We show that even on non-convex and non-differentiable models where the theory breaks down, approximations to influence functions can still provide valuable information. On linear models and convolutional neural networks, we demonstrate that influence functions are useful for multiple purposes: understanding model behavior, debugging models, detecting dataset errors, and even creating visually-indistinguishable training-set attacks.

Content citing this item

Seminar

Studying LLMs with Influence Functions

Fabio Peruzzo, Senior AI Engineer at appliedAI Initiative, will talk about a recent work on the use of influence functions to study the …

Data Valuation

Feb 29, 2024

Pill

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Accurately computing influence functions involves solving inverse Hessian problems, a challenging task as the parameter count increases, …

Data Valuation

Nov 27, 2023

Blog

Applications of data valuation in machine learning

At TransferLab we have extensively covered existing and developing methods for Data valuation, the task of attributing value to samples in a …

Data Valuation

Nov 20, 2023

Pill

Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value

The out-of-bag (OOB) error estimate is a scalable approach to data valuation. Unlike marginal contribution methods, Data-OOB can leverage …

Data Valuation

Nov 17, 2023

Pill

Studying Large Language Model Generalization with Influence Functions

Influence functions are a tool to quantify the impact of each training sample on a model’s predictions, thereby assisting in the …

Data Valuation

Nov 3, 2023

Series

Data valuation

Attributions of value to training samples can be used to examine data, improve data acquisition, debug and improve models or compensate data …

Oct 24, 2023

Seminar

Influence functions and Data Pruning: from theory to non-convergence

Today’s session brings Influence Functions under the spotlight - the theory, non-convergence issues, and uses for data pruning. Fabio will …

Explainable AI

Jun 15, 2023

Software

pyDVL: the python Data Valuation Library

pyDVL strives to offer reference implementations of algorithms for data valuation and influence function computation, with an emphasis on …

Data Valuation

Oct 12, 2022

Pill

Resolving Training Biases via Influence-based Data Relabeling

Influence functions are used to correct corrupt labels in a dataset that would significantly decrease a trained model’s performance. …

Data Efficiency

May 3, 2022

Series

Explainable AI

Large opaque models like neural networks require dedicated methods to study and interpret their behavior. In this series we review recent …

Jan 1, 0001

All works referenced in our site...

Abstract

Content citing this item

Studying LLMs with Influence Functions

Fabio Peruzzo, Senior AI Engineer at appliedAI Initiative, will talk about a recent work on the use of influence functions to study the …

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Accurately computing influence functions involves solving inverse Hessian problems, a challenging task as the parameter count increases, …

Applications of data valuation in machine learning

At TransferLab we have extensively covered existing and developing methods for Data valuation, the task of attributing value to samples in a …

Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value

The out-of-bag (OOB) error estimate is a scalable approach to data valuation. Unlike marginal contribution methods, Data-OOB can leverage …

Studying Large Language Model Generalization with Influence Functions

Influence functions are a tool to quantify the impact of each training sample on a model&rsquo;s predictions, thereby assisting in the …

Data valuation

Attributions of value to training samples can be used to examine data, improve data acquisition, debug and improve models or compensate data …

Influence functions and Data Pruning: from theory to non-convergence

Today’s session brings Influence Functions under the spotlight - the theory, non-convergence issues, and uses for data pruning. Fabio will …

pyDVL: the python Data Valuation Library

pyDVL strives to offer reference implementations of algorithms for data valuation and influence function computation, with an emphasis on …

Resolving Training Biases via Influence-based Data Relabeling

Influence functions are used to correct corrupt labels in a dataset that would significantly decrease a trained model&rsquo;s performance. …

Explainable AI

Large opaque models like neural networks require dedicated methods to study and interpret their behavior. In this series we review recent …

Influence functions are a tool to quantify the impact of each training sample on a model’s predictions, thereby assisting in the …

Influence functions are used to correct corrupt labels in a dataset that would significantly decrease a trained model’s performance. …