Generalized stochastic Frank–Wolfe algorithm with stochastic “substitute” gradient for structured convex optimization

Lu, Haihao; Freund, Robert M

dc.contributor.author	Lu, Haihao
dc.contributor.author	Freund, Robert M
dc.date.accessioned	2021-11-01T14:33:17Z
dc.date.available	2021-11-01T14:33:17Z
dc.date.issued	2020-03-04
dc.identifier.uri	https://hdl.handle.net/1721.1/136776
dc.description.abstract	Abstract The stochastic Frank–Wolfe method has recently attracted much general interest in the context of optimization for statistical and machine learning due to its ability to work with a more general feasible region. However, there has been a complexity gap in the dependence on the optimality tolerance $$\varepsilon $$ ε in the guaranteed convergence rate for stochastic Frank–Wolfe compared to its deterministic counterpart. In this work, we present a new generalized stochastic Frank–Wolfe method which closes this gap for the class of structured optimization problems encountered in statistical and machine learning characterized by empirical loss minimization with a certain type of “linear prediction” property (formally defined in the paper), which is typically present in loss minimization problems in practice. Our method also introduces the notion of a “substitute gradient” that is a not-necessarily-unbiased estimate of the gradient. We show that our new method is equivalent to a particular randomized coordinate mirror descent algorithm applied to the dual problem, which in turn provides a new interpretation of randomized dual coordinate descent in the primal space. Also, in the special case of a strongly convex regularizer our generalized stochastic Frank–Wolfe method (as well as the randomized dual coordinate descent method) exhibits linear convergence. Furthermore, we present computational experiments that indicate that our method outperforms other stochastic Frank–Wolfe methods for a sufficiently small optimality tolerance, consistent with the theory developed herein.	en_US
dc.publisher	Springer Berlin Heidelberg	en_US
dc.relation.isversionof	https://doi.org/10.1007/s10107-020-01480-7	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	Springer Berlin Heidelberg	en_US
dc.title	Generalized stochastic Frank–Wolfe algorithm with stochastic “substitute” gradient for structured convex optimization	en_US
dc.type	Article	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Mathematics
dc.contributor.department	Sloan School of Management
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2021-04-21T03:31:56Z
dc.language.rfc3066	en
dc.rights.holder	Springer-Verlag GmbH Germany, part of Springer Nature and Mathematical Optimization Society
dspace.embargo.terms	Y
dspace.date.submission	2021-04-21T03:31:55Z
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed

Files in this item

Name:: 10107_2020_1480_ReferencePDF.pdf
Size:: 689.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record