Learning hierarchical teaching policies for cooperative agents

How, Jonathan P.

dc.contributor.author	How, Jonathan P.
dc.date.accessioned	2021-11-02T18:45:14Z
dc.date.available	2021-11-02T18:45:14Z
dc.date.issued	2020
dc.identifier.uri	https://hdl.handle.net/1721.1/137164
dc.description.abstract	© 2020 International Foundation for Autonomous. Collective learning can be greatly enhanced when agents effectively exchange knowledge with their peers. In particular, recent work studying agents that learn to teach other teammates has demonstrated that action advising accelerates team-wide learning. However, the prior work has simplified the learning of advising policies by using simple function approximations and only considered advising with primitive (low-level) actions, limiting the scalability of learning and teaching to complex domains. This paper introduces a novel learning-to-teach framework, called hierarchical multiagent teaching (HMAT), that improves scalability to complex environments by using the deep representation for student policies and by advising with more expressive extended action sequences over multiple levels of temporal abstraction. Our empirical evaluations demonstrate that HMAT improves team-wide learning progress in large, complex domains where previous approaches fail. HMAT also learns teaching policies that can effectively transfer knowledge to different teammates with knowledge of different tasks, even when the teammates have heterogeneous action spaces.	en_US
dc.language.iso	en
dc.relation.isversionof	https://dl.acm.org/doi/10.5555/3398761.3398836	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Learning hierarchical teaching policies for cooperative agents	en_US
dc.type	Article	en_US
dc.identifier.citation	How, Jonathan P. 2020. "Learning hierarchical teaching policies for cooperative agents." Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2020-May.
dc.contributor.department	MIT-IBM Watson AI Lab
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
dc.relation.journal	Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-04-30T14:19:30Z
dspace.orderedauthors	Kim, DK; Liu, M; Omidshafiei, S; Lopez-Cot, S; Riemer, M; Habibi, G; Tesauro, G; Mourad, S; Campbell, M; How, JP	en_US
dspace.date.submission	2021-04-30T14:19:31Z
mit.journal.volume	2020-May	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 1903.03216.pdf
Size:: 1.107Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record