Crossmodal attentive skill learner

Omidshafiei, Shayegan; Kim, Dong-Ki; Pazis, Jason; How, Jonathan P.

dc.contributor.author	Omidshafiei, Shayegan
dc.contributor.author	Kim, Dong-Ki
dc.contributor.author	Pazis, Jason
dc.contributor.author	How, Jonathan P.
dc.date.accessioned	2021-11-08T18:06:12Z
dc.date.available	2021-11-08T18:06:12Z
dc.date.issued	2018
dc.identifier.uri	https://hdl.handle.net/1721.1/137749
dc.description.abstract	© 2018 International Foundation for Autonomous Agents and Multiagent Systems. This paper introduces the Crossmodal Attentive Skill Learner (CASL), Integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture (15] to enable hierarchical rei nforcement learning across multiple sensory inputs. We provide concrete examples where the approach not only improves perform ance in a single task, but accelerates transfer to new tasks. We demonstrate the attention mechanism anticipates and identifies useful latent features, while filtering irrelevant sensor modalities during execution. We modify the Arcade Learning Environment (7] to support audio queries, and conduct evaluations of crossmodal learning in the Atari 2600 games H.E.R.O. and Amidar. Finally, buildi ng on the recent work of Babaeizadeh et aL [4], we open-soulce a fast hybrid CPU-CPU implementation of CASL.	en_US
dc.language.iso	en
dc.relation.isversionof	http://ifaamas.org/Proceedings/aamas2018/pdfs/p139.pdf	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Crossmodal attentive skill learner	en_US
dc.type	Article	en_US
dc.identifier.citation	Omidshafiei, Shayegan, Kim, Dong-Ki, Pazis, Jason and How, Jonathan P. 2018. "Crossmodal attentive skill learner."
dc.contributor.department	Massachusetts Institute of Technology. Aerospace Controls Laboratory
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-10-28T15:14:27Z
dspace.date.submission	2019-10-28T15:14:39Z
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 1711.10314.pdf
Size:: 2.547Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record