Direct optimization through arg max for discrete variational auto-encoder

Gane, Andreea; Jaakkola, Tommi S

dc.contributor.author	Gane, Andreea
dc.contributor.author	Jaakkola, Tommi S
dc.date.accessioned	2021-01-19T15:13:42Z
dc.date.available	2021-01-19T15:13:42Z
dc.date.issued	2019-12
dc.date.submitted	2019-10
dc.identifier.issn	1049-5258
dc.identifier.uri	https://hdl.handle.net/1721.1/129438
dc.description.abstract	Reparameterization of variational auto-encoders with continuous random variables is an effective method for reducing the variance of their gradient estimates. In the discrete case, one can perform reparametrization using the Gumbel-Max trick, but the resulting objective relies on an arg max operation and is non-differentiable. In contrast to previous works which resort to softmax-based relaxations, we propose to optimize it directly by applying the direct loss minimization approach. Our proposal extends naturally to structured discrete latent variable models when evaluating the arg max operation is tractable. We demonstrate empirically the effectiveness of the direct loss minimization technique in variational autoencoders with both unstructured and structured discrete latent variables.	en_US
dc.language.iso	en
dc.publisher	Morgan Kaufmann Publishers	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Direct optimization through arg max for discrete variational auto-encoder	en_US
dc.type	Article	en_US
dc.identifier.citation	Lorberbom, Guy et al. “Direct optimization through arg max for discrete variational auto-encoder.” 33rd Conference on Neural Information Processing Systems, December 2019, Vancouver, Canada, Morgan Kaufmann Publishers, 2019. © 2019 The Author(s)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.relation.journal	33rd Conference on Neural Information Processing Systems (NeurIPS 2019)	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2020-12-21T15:49:12Z
dspace.orderedauthors	Lorberbom, G; Gane, A; Jaakkola, T; Hazan, T	en_US
dspace.date.submission	2020-12-21T15:49:15Z
mit.journal.volume	32	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Complete

Files in this item

Name:: NeurIPS-2019-direct-optimizati ...
Size:: 2.451Mb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record