The Rate of Convergence of AdaBoost

Mukherjee, Indraneel; Rudin, Cynthia; Schapire, Robert E.

dc.contributor.author	Mukherjee, Indraneel
dc.contributor.author	Rudin, Cynthia
dc.contributor.author	Schapire, Robert E.
dc.date.accessioned	2013-12-23T21:15:38Z
dc.date.available	2013-12-23T21:15:38Z
dc.date.issued	2013-08
dc.date.submitted	2013-05
dc.identifier.issn	1532-4435
dc.identifier.issn	1533-7928
dc.identifier.uri	http://hdl.handle.net/1721.1/83258
dc.description.abstract	The AdaBoost algorithm was designed to combine many “weak” hypotheses that perform slightly better than random guessing into a “strong” hypothesis that has very low error. We study the rate at which AdaBoost iteratively converges to the minimum of the “exponential loss”. Unlike previous work, our proofs do not require a weak-learning assumption, nor do they require that minimizers of the exponential loss are finite. Our first result shows that the exponential loss of AdaBoost's computed parameter vector will be at most ε more than that of any parameter vector of ℓ[subscript 1]-norm bounded by B in a number of rounds that is at most a polynomial in B and 1/ε. We also provide lower bounds showing that a polynomial dependence is necessary. Our second result is that within C/ε iterations, AdaBoost achieves a value of the exponential loss that is at most ε more than the best possible value, where C depends on the data set. We show that this dependence of the rate on ε is optimal up to constant factors, that is, at least Ω(1/ε) rounds are necessary to achieve within ε of the optimal exponential loss.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant IIS-1016029)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant IIS-1053407)	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://jmlr.org/papers/v14/mukherjee13b.html	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Journal of Machine Learning Research	en_US
dc.title	The Rate of Convergence of AdaBoost	en_US
dc.type	Article	en_US
dc.identifier.citation	Mukherjee, Indraneel, Cynthia Rudin, and Robert E. Schapire. “The Rate of Convergence of AdaBoost.” Journal of Machine Learning Research 14 (2013): 2315–2347.	en_US
dc.contributor.department	Sloan School of Management	en_US
dc.contributor.mitauthor	Rudin, Cynthia	en_US
dc.relation.journal	Journal of Machine Learning Research	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Mukherjee, Indraneel; Rudin, Cynthia; Schapire, Robert E.	en_US
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Mukherjee-2013-Rate of converg ...
Size:: 299.0Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record