Revisiting Classifier Two-Sample Tests | TransferLab

Reference

Revisiting Classifier Two-Sample Tests, David Lopez-Paz, Maxime Oquab. (2022)

Abstract

The goal of two-sample tests is to assess whether two samples, $S_P \sim P^n$ and $S_Q \sim Q^m$, are drawn from the same distribution. Perhaps intriguingly, one relatively unexplored method to build two-sample tests is the use of binary classifiers. In particular, construct a dataset by pairing the $n$ examples in $S_P$ with a positive label, and by pairing the $m$ examples in $S_Q$ with a negative label. If the null hypothesis ``$P = Q$'' is true, then the classification accuracy of a binary classifier on a held-out subset of this dataset should remain near chance-level. As we will show, such \emph{Classifier Two-Sample Tests} (C2ST) learn a suitable representation of the data on the fly, return test statistics in interpretable units, have a simple null distribution, and their predictive uncertainty allow to interpret where $P$ and $Q$ differ. The goal of this paper is to establish the properties, performance, and uses of C2ST. First, we analyze their main theoretical properties. Second, we compare their performance against a variety of state-of-the-art alternatives. Third, we propose their use to evaluate the sample quality of generative models with intractable likelihoods, such as Generative Adversarial Networks (GANs). Fourth, we showcase the novel application of GANs together with C2ST for causal discovery.

Content citing this item

Software

sbi: the simulation-based inference toolkit

sbi is a Python package for Bayesian parameter inference on simulators. It implements state-of-the-art algorithms and comes with …

Simulation-Based Inference

Apr 15, 2024

Pill

A Trust Crisis In Simulation-Based Inference? Your Posterior Approximations Can Be Unfaithful

In posterior estimation, it is generally worse to exclude plausible parameters than wrongly including implausible ones. However, extensive …

Simulation-Based Inference

Feb 24, 2023

All works referenced in our site...