Non-Linear Monte-Carlo Search in Civilization II

Branavan, Satchuthanan R.; Silver, David; Barzilay, Regina

dc.contributor.author	Branavan, Satchuthanan R.
dc.contributor.author	Silver, David
dc.contributor.author	Barzilay, Regina
dc.date.accessioned	2012-10-24T20:34:34Z
dc.date.available	2012-10-24T20:34:34Z
dc.date.issued	2011-07
dc.identifier.isbn	978-1-57735-512-0
dc.identifier.isbn	978-1-57735-516-8
dc.identifier.uri	http://hdl.handle.net/1721.1/74248
dc.description.abstract	This paper presents a new Monte-Carlo search algorithm for very large sequential decision-making problems. Our approach builds on the recent success of Monte-Carlo tree search algorithms, which estimate the value of states and actions from the mean outcome of random simulations. Instead of using a search tree, we apply non-linear regression, online, to estimate a state-action value function from the outcomes of random simulations. This value function generalizes between related states and actions, and can therefore provide more accurate evaluations after fewer simulations. We apply our Monte-Carlo search algorithm to the game of Civilization II, a challenging multi-agent strategy game with an enormous state space and around $10^{21}$ joint actions. We approximate the value function by a neural network, augmented by linguistic knowledge that is extracted automatically from the official game manual. We show that this non-linear value function is significantly more efficient than a linear value function. Our non-linear Monte-Carlo search wins 80\% of games against the handcrafted, built-in AI for Civilization II.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (CAREER grant IIS-0448168)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (grant IIS-0835652)	en_US
dc.description.sponsorship	United States. Defense Advanced Research Projects Agency (DARPA Machine Reading Program (FA8750-09-C-0172))	en_US
dc.description.sponsorship	Microsoft Research (New Faculty Fellowship)	en_US
dc.language.iso	en_US
dc.publisher	AAAI Press/International Joint Conferences on Artificial Intelligence	en_US
dc.relation.isversionof	http://ijcai-11.iiia.csic.es/program/paper/1252	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Non-Linear Monte-Carlo Search in Civilization II	en_US
dc.type	Article	en_US
dc.identifier.citation	Branavan, S. R. K. David Silver, and Regina Barzilay. "Non-Linear Monte-Carlo Search in Civilization II." in Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain, 16–22 July 2011. p.2404.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.approver	Barzilay, Regina
dc.contributor.mitauthor	Branavan, Satchuthanan R.
dc.contributor.mitauthor	Barzilay, Regina
dc.relation.journal	Proceedings of the Twenty-second International Joint Conference on Artificial Intelligence	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Branavan, S.R.K; Silver, David; Barzilay, Regina	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-2921-8201
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Barzilay-Non-Linear Monte.pdf
Size:: 294.8Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record