Crossmodal attentive skill learner

Omidshafiei, Shayegan; Kim, Dong-Ki; Pazis, Jason; How, Jonathan P.

Author(s)

Omidshafiei, Shayegan; Kim, Dong-Ki; Pazis, Jason; How, Jonathan P.

DownloadAccepted version (2.547Mb)

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

© 2018 International Foundation for Autonomous Agents and Multiagent Systems. This paper introduces the Crossmodal Attentive Skill Learner (CASL), Integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture (15] to enable hierarchical rei nforcement learning across multiple sensory inputs. We provide concrete examples where the approach not only improves perform ance in a single task, but accelerates transfer to new tasks. We demonstrate the attention mechanism anticipates and identifies useful latent features, while filtering irrelevant sensor modalities during execution. We modify the Arcade Learning Environment (7] to support audio queries, and conduct evaluations of crossmodal learning in the Atari 2600 games H.E.R.O. and Amidar. Finally, buildi ng on the recent work of Babaeizadeh et aL [4], we open-soulce a fast hybrid CPU-CPU implementation of CASL.

Date issued

2018

URI

https://hdl.handle.net/1721.1/137749

Department

Massachusetts Institute of Technology. Aerospace Controls Laboratory; Massachusetts Institute of Technology. Department of Aeronautics and Astronautics

Citation

Omidshafiei, Shayegan, Kim, Dong-Ki, Pazis, Jason and How, Jonathan P. 2018. "Crossmodal attentive skill learner."

Version: Author's final manuscript

Collections

MIT Open Access Articles