Show simple item record

dc.contributor.authorMeshi, Ofer
dc.contributor.authorSontag, David Alexander
dc.contributor.authorJaakkola, Tommi S.
dc.contributor.authorGloberson, Amir
dc.date.accessioned2011-05-19T21:39:04Z
dc.date.available2011-05-19T21:39:04Z
dc.date.issued2010-01
dc.identifier.isbn9781605589077
dc.identifier.isbn1605589071
dc.identifier.urihttp://hdl.handle.net/1721.1/62851
dc.description.abstractMany structured prediction tasks involve complex models where inference is computationally intractable, but where it can be well approximated using a linear programming relaxation. Previous approaches for learning for structured prediction (e.g., cutting- plane, subgradient methods, perceptron) repeatedly make predictions for some of the data points. These approaches are computationally demanding because each prediction involves solving a linear program to optimality. We present a scalable algorithm for learning for structured prediction. The main idea is to instead solve the dual of the structured prediction loss. We formulate the learning task as a convex minimization over both the weights and the dual variables corresponding to each data point. As a result, we can begin to optimize the weights even before completely solving any of the individual prediction problems. We show how the dual variables can be efficiently optimized using coordinate descent. Our algorithm is competitive with state-of-the-art methods such as stochastic subgradient and cutting-plane.en_US
dc.language.isoen_US
dc.publisherInternational Machine Learning Societyen_US
dc.relation.isversionofhttp://www.icml2010.org/papers/587.pdfen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceMIT web domainen_US
dc.titleLearning efficiently with approximate inference via dual lossesen_US
dc.typeArticleen_US
dc.identifier.citationMeshi, Ofer et al. "Learning Efficiently with Approximate Inference via Dual Losses" Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 2010.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverJaakkola, Tommi S.
dc.contributor.mitauthorSontag, David Alexander
dc.contributor.mitauthorJaakkola, Tommi S.
dc.relation.journalInternational Conference on Machine Learning (27th, 2010) proceedingsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsMeshi, Ofer; Sontag, David; Jaakkola, Tommi; Globerson, Amir
dc.identifier.orcidhttps://orcid.org/0000-0002-2199-0379
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record