Show simple item record

dc.contributor.authorPuig, Xavier
dc.contributor.authorRa, Kevin
dc.contributor.authorBoben, Marko
dc.contributor.authorLi, Jiaman
dc.contributor.authorWang, Tingwu
dc.contributor.authorFidler, Sanja
dc.contributor.authorTorralba, Antonio
dc.date.accessioned2020-04-30T19:36:26Z
dc.date.available2020-04-30T19:36:26Z
dc.date.issued2018-12
dc.date.submitted2018-06
dc.identifier.issn2575-7075
dc.identifier.issn1063-6919
dc.identifier.urihttps://hdl.handle.net/1721.1/124950
dc.description.abstractIn this paper, we are interested in modeling complex activities that occur in a typical household. We propose to use programs, i.e., sequences of atomic actions and interactions, as a high level representation of complex tasks. Programs are interesting because they provide a non-ambiguous representation of a task, and allow agents to execute them. However, nowadays, there is no database providing this type of information. Towards this goal, we first crowd-source programs for a variety of activities that happen in people's homes, via a game-like interface used for teaching kids how to code. Using the collected dataset, we show how we can learn to extract programs directly from natural language descriptions or from videos. We then implement the most common atomic (inter)actions in the Unity3D game engine, and use our programs to 'drive' an artificial agent to execute tasks in a simulated household environment. Our VirtualHome simulator allows us to create a large activity video dataset with rich ground-truth, enabling training and testing of video understanding models. We further showcase examples of our agent performing tasks in our VirtualHome based on language descriptions. © 2018 IEEE.en_US
dc.description.sponsorshipNSERC COHESA NETGP485577-15en_US
dc.description.sponsorshipIARPA D17PC00341en_US
dc.language.isoen
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/CVPR.2018.00886en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleVirtualHome: Simulating Household Activities Via Programsen_US
dc.typeArticleen_US
dc.identifier.citationPuig, Xavier, et al. "VirtualHome: Simulating Household Activities Via Programs." IEEE/CVF Conference on Computer Vision and Pattern Recognition (June 2018): 18326092 © 2018 Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journal2018 IEEE/CVF Conference on Computer Vision and Pattern Recognitionen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-07-11T17:35:39Z
dspace.date.submission2019-07-11T17:35:41Z
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record