Online decision problems with large strategy sets

Kleinberg, Robert David

dc.contributor.advisor	F. Thomson Leighton.	en_US
dc.contributor.author	Kleinberg, Robert David	en_US
dc.contributor.other	Massachusetts Institute of Technology. Dept. of Mathematics.	en_US
dc.date.accessioned	2006-06-19T17:39:44Z
dc.date.available	2006-06-19T17:39:44Z
dc.date.copyright	2005	en_US
dc.date.issued	2005	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/33092
dc.description	Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Mathematics, 2005.	en_US
dc.description	Includes bibliographical references (p. 165-171).	en_US
dc.description.abstract	In an online decision problem, an algorithm performs a sequence of trials, each of which involves selecting one element from a fixed set of alternatives (the "strategy set") whose costs vary over time. After T trials, the combined cost of the algorithm's choices is compared with that of the single strategy whose combined cost is minimum. Their difference is called regret, and one seeks algorithms which are efficient in that their regret is sublinear in T and polynomial in the problem size. We study an important class of online decision problems called generalized multi- armed bandit problems. In the past such problems have found applications in areas as diverse as statistics, computer science, economic theory, and medical decision-making. Most existing algorithms were efficient only in the case of a small (i.e. polynomial- sized) strategy set. We extend the theory by supplying non-trivial algorithms and lower bounds for cases in which the strategy set is much larger (exponential or infinite) and the cost function class is structured, e.g. by constraining the cost functions to be linear or convex. As applications, we consider adaptive routing in networks, adaptive pricing in electronic markets, and collaborative decision-making by untrusting peers in a dynamic environment.	en_US
dc.description.statementofresponsibility	by Robert David Kleinberg.	en_US
dc.format.extent	171 p.	en_US
dc.format.extent	10061360 bytes
dc.format.extent	10071115 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/pdf
dc.language.iso	eng	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582
dc.subject	Mathematics.	en_US
dc.title	Online decision problems with large strategy sets	en_US
dc.type	Thesis	en_US
dc.description.degree	Ph.D.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Mathematics
dc.identifier.oclc	62173704	en_US

Files in this item

Name:: 62173704-MIT.pdf
Size:: 9.604Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Doctoral Theses

Show simple item record