Learning efficiently with approximate inference via dual losses

Meshi, Ofer; Sontag, David; Jaakkola, Tommi; Globerson, Amir

Author(s)

Meshi, Ofer; Sontag, David Alexander; Jaakkola, Tommi S.; Globerson, Amir

DownloadJaakkola_Learning efficiently.pdf (306.6Kb)

PUBLISHER_POLICY

Terms of use

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.

Metadata

Show full item record

Abstract

Many structured prediction tasks involve complex models where inference is computationally intractable, but where it can be well approximated using a linear programming relaxation. Previous approaches for learning for structured prediction (e.g., cutting- plane, subgradient methods, perceptron) repeatedly make predictions for some of the data points. These approaches are computationally demanding because each prediction involves solving a linear program to optimality. We present a scalable algorithm for learning for structured prediction. The main idea is to instead solve the dual of the structured prediction loss. We formulate the learning task as a convex minimization over both the weights and the dual variables corresponding to each data point. As a result, we can begin to optimize the weights even before completely solving any of the individual prediction problems. We show how the dual variables can be efficiently optimized using coordinate descent. Our algorithm is competitive with state-of-the-art methods such as stochastic subgradient and cutting-plane.

Date issued

2010-01

URI

http://hdl.handle.net/1721.1/62851

Department

Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

International Conference on Machine Learning (27th, 2010) proceedings

Publisher

International Machine Learning Society

Citation

Meshi, Ofer et al. "Learning Efficiently with Approximate Inference via Dual Losses" Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 2010.

Version: Final published version

ISBN

9781605589077

1605589071

Collections

MIT Open Access Articles

DSpace@MIT