Show simple item record

dc.contributor.authorJaakkola, Tommi
dc.contributor.authorGifford, David
dc.contributor.authorMueller, Jonas
dc.date.accessioned2021-11-08T12:44:49Z
dc.date.available2021-11-08T12:44:49Z
dc.date.issued2017
dc.identifier.urihttps://hdl.handle.net/1721.1/137633
dc.description.abstract© 2017 by the author(s). We present a model that, after learning on observations of (sequence, outcome) pairs, can be efficiently used to revise a new sequence in order to improve its associated outcome. Our framework requires neither example improvements, nor additional evaluation of outcomes for proposed revisions. To avoid combinatorial-search over sequence elements, we specify a generative model with continuous latent factors, which is learned via joint approximate inference using a recurrent variational autoencoder (VAE) and an outcome-predicting neural network module. Under this model, gradient methods can be used to efficiently optimize the continuous latent factors with respect to inferred outcomes. By appropriately constraining this optimization and using the VAE decoder to generate a revised sequence, we ensure the revision is fundamentally similar to the original sequence, is associated with better outcomes, and looks natural. These desiderata are proven to hold with high probability under our approach, which is empirically demonstrated for revising natural language sentences.en_US
dc.language.isoen
dc.relation.isversionofhttp://proceedings.mlr.press/v70/mueller17a.htmlen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleSequence to better sequence: Continuous revision of combinatorial structuresen_US
dc.typeArticleen_US
dc.identifier.citationJaakkola, Tommi, Gifford, David and Mueller, Jonas. 2017. "Sequence to better sequence: Continuous revision of combinatorial structures."
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-05-29T14:20:31Z
dspace.date.submission2019-05-29T14:20:32Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record