Show simple item record

dc.contributor.authorPeng, Andi
dc.contributor.authorBobu, Andreea
dc.contributor.authorLi, Belinda Z.
dc.contributor.authorSumers, Theodore R.
dc.contributor.authorSucholutsky, Ilia
dc.contributor.authorKumar, Nishanth
dc.contributor.authorGriffiths, Thomas L.
dc.contributor.authorShah, Julie A.
dc.date.accessioned2024-04-03T18:12:14Z
dc.date.available2024-04-03T18:12:14Z
dc.date.issued2024-03-11
dc.identifier.isbn979-8-4007-0322-5
dc.identifier.urihttps://hdl.handle.net/1721.1/154050
dc.descriptionHRI ’24, March 11–14, 2024, Boulder, CO, USAen_US
dc.description.abstractLearning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard to describe or infeasible to exhaustively specify using language alone. How do we construct abstractions to capture these latent preferences? We observe that how humans behave reveals how they see the world. Our key insight is that changes in human behavior inform us that there are differences in preferences for how humans see the world, i.e. their state abstractions. In this work, we propose using language models (LMs) to query for those preferences directly given knowledge that a change in behavior has occurred. In our framework, we use the LM in two ways: first, given a text description of the task and knowledge of behavioral change between states, we query the LM for possible hidden preferences; second, given the most likely preference, we query the LM to construct the state abstraction. In this framework, the LM is also able to ask the human directly when uncertain about its own estimate. We demonstrate our framework's ability to construct effective preference-conditioned abstractions in simulated experiments, a user study, as well as on a real Spot robot performing mobile manipulation tasks.en_US
dc.publisherACMen_US
dc.relation.isversionof10.1145/3610977.3634930en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceACMen_US
dc.titlePreference-Conditioned Language-Guided Abstractionen_US
dc.typeArticleen_US
dc.identifier.citationPeng, Andi, Bobu, Andreea, Li, Belinda Z., Sumers, Theodore R., Sucholutsky, Ilia et al. 2024. "Preference-Conditioned Language-Guided Abstraction."
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronautics
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2024-04-01T07:45:33Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2024-04-01T07:45:33Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record