Skip to main navigation Skip to search Skip to main content

Smooth p-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications

  • Sloan Nietert
  • , Ziv Goldfeld
  • , Kengo Kato

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

23 Scopus citations

Abstract

Discrepancy measures between probability distributions, often termed statistical distances, are ubiquitous in probability theory, statistics and machine learning. To combat the curse of dimensionality when estimating these distances from data, recent work has proposed smoothing out local irregularities in the measured distributions via convolution with a Gaussian kernel. Motivated by the scalability of this framework to high dimensions, we investigate the structural and statistical behavior of the Gaussian-smoothed p-Wasserstein distance Wp(σ), for arbitrary p ≥ 1. After establishing basic metric and topological properties of Wp(σ), we explore the asymptotic statistical behavior of Wp(σ)(µn, µ), where µn is the empirical distribution of n independent observations from µ. We prove that Wp(σ) enjoys a parametric empirical convergence rate of n−1/2, which contrasts the n−1/d rate for unsmoothed Wp when d ≥ 3. Our proof relies on controlling Wp(σ) by a pth-order smooth Sobolev distance d(pσ) and deriving the limit distribution of √nd(pσ)(µn, µ), for all dimensions d. As applications, we provide asymptotic guarantees for two-sample testing and minimum distance estimation using Wp(σ), with experiments for p = 2 using a maximum mean discrepancy formulation of d(2σ).

Original languageEnglish
Title of host publicationProceedings of the 38th International Conference on Machine Learning, ICML 2021
PublisherML Research Press
Pages8172-8183
Number of pages12
ISBN (Electronic)9781713845065
StatePublished - 1 Jan 2021
Externally publishedYes
Event38th International Conference on Machine Learning, ICML 2021 - Virtual, Online
Duration: 18 Jul 202124 Jul 2021

Publication series

NameProceedings of Machine Learning Research
Volume139
ISSN (Electronic)2640-3498

Conference

Conference38th International Conference on Machine Learning, ICML 2021
CityVirtual, Online
Period18/07/2124/07/21

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Smooth p-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications'. Together they form a unique fingerprint.

Cite this