Show simple item record

dc.contributor.authorKollar, Thomas Fleming
dc.contributor.authorTellex, Stefanie A.
dc.contributor.authorRoy, Deb K.
dc.contributor.authorRoy, Nicholas
dc.date.accessioned2011-11-15T13:56:05Z
dc.date.available2011-11-15T13:56:05Z
dc.date.issued2010-03
dc.identifier.isbn978-1-4244-4893-7
dc.identifier.urihttp://hdl.handle.net/1721.1/67029
dc.description.abstractSpeaking using unconstrained natural language is an intuitive and flexible way for humans to interact with robots. Understanding this kind of linguistic input is challenging because diverse words and phrases must be mapped into structures that the robot can understand, and elements in those structures must be grounded in an uncertain environment. We present a system that follows natural language directions by extracting a sequence of spatial description clauses from the linguistic input and then infers the most probable path through the environment given only information about the environmental geometry and detected visible objects. We use a probabilistic graphical model that factors into three key components. The first component grounds landmark phrases such as "the computers" in the perceptual frame of the robot by exploiting co-occurrence statistics from a database of tagged images such as Flickr. Second, a spatial reasoning component judges how well spatial relations such as "past the computers" describe a path. Finally, verb phrases such as "turn right" are modeled according to the amount of change in orientation in the path. Our system follows 60% of the directions in our corpus to within 15 meters of the true destination, significantly outperforming other approaches.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (MURI N00014-07-1-0749)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/1734454.1734553en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleToward understanding natural language directionsen_US
dc.typeArticleen_US
dc.identifier.citationKollar, Thomas et al. “Toward Understanding Natural Language Directions.” Proceeding of the 5th ACM/IEEE International Conference on Human-robot Interaction - HRI ’10. Osaka, Japan, 2010. 259.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Media Laboratoryen_US
dc.contributor.approverRoy, Deb K.
dc.contributor.mitauthorKollar, Thomas Fleming
dc.contributor.mitauthorTellex, Stefanie A.
dc.contributor.mitauthorRoy, Deb K.
dc.contributor.mitauthorRoy, Nicholas
dc.relation.journalProceeding of the 5th ACM/IEEE international conference on Human-robot interactionen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsKollar, Thomas; Tellex, Stefanie; Roy, Deb; Roy, Nicholasen
dc.identifier.orcidhttps://orcid.org/0000-0002-4333-7194
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record