| dc.contributor.author | Omidshafiei, Shayegan | |
| dc.contributor.author | Kim, Dong-Ki | |
| dc.contributor.author | Pazis, Jason | |
| dc.contributor.author | How, Jonathan P. | |
| dc.date.accessioned | 2021-11-08T18:06:12Z | |
| dc.date.available | 2021-11-08T18:06:12Z | |
| dc.date.issued | 2018 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/137749 | |
| dc.description.abstract | © 2018 International Foundation for Autonomous Agents and Multiagent Systems. This paper introduces the Crossmodal Attentive Skill Learner (CASL), Integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture (15] to enable hierarchical rei nforcement learning across multiple sensory inputs. We provide concrete examples where the approach not only improves perform ance in a single task, but accelerates transfer to new tasks. We demonstrate the attention mechanism anticipates and identifies useful latent features, while filtering irrelevant sensor modalities during execution. We modify the Arcade Learning Environment (7] to support audio queries, and conduct evaluations of crossmodal learning in the Atari 2600 games H.E.R.O. and Amidar. Finally, buildi ng on the recent work of Babaeizadeh et aL [4], we open-soulce a fast hybrid CPU-CPU implementation of CASL. | en_US |
| dc.language.iso | en | |
| dc.relation.isversionof | http://ifaamas.org/Proceedings/aamas2018/pdfs/p139.pdf | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | arXiv | en_US |
| dc.title | Crossmodal attentive skill learner | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Omidshafiei, Shayegan, Kim, Dong-Ki, Pazis, Jason and How, Jonathan P. 2018. "Crossmodal attentive skill learner." | |
| dc.contributor.department | Massachusetts Institute of Technology. Aerospace Controls Laboratory | |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2019-10-28T15:14:27Z | |
| dspace.date.submission | 2019-10-28T15:14:39Z | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |