Statistical optimal transport via factored couplings
Author(s)
Forrow, A; Hütter, JC; Nitzan, M; Rigollet, P; Schiebinger, G; Weed, J; ... Show more Show less
DownloadPublished version (1.809Mb)
Publisher Policy
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
© 2019 by the author(s). We propose a new method to estimate Wasserstein distances and optimal transport plans between two probability distributions from samples in high dimension. Unlike plug-in rules that simply replace the true distributions by their empirical counterparts, our method promotes couplings with low transport rank, a new structural assumption that is similar to the nonnegative rank of a matrix. Regularizing based on this assumption leads to drastic improvements on high-dimensional data for various tasks, including domain adaptation in single-cell RNA sequencing data. These findings are supported by a theoretical analysis that indicates that the transport rank is key in overcoming the curse of dimensionality inherent to data-driven optimal transport.
Date issued
2019Department
Massachusetts Institute of Technology. Department of Mathematics; Statistics and Data Science Center (Massachusetts Institute of Technology)Journal
AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics
Citation
Forrow, A, Hütter, JC, Nitzan, M, Rigollet, P, Schiebinger, G et al. 2019. "Statistical optimal transport via factored couplings." AISTATS 2019 - 22nd International Conference on Artificial Intelligence and Statistics, 89.
Version: Final published version