Show simple item record

dc.contributor.authorKollar, Thomas Fleming
dc.contributor.authorTellex, Stefanie A.
dc.contributor.authorRoy, Nicholas
dc.date.accessioned2013-10-18T17:54:26Z
dc.date.available2013-10-18T17:54:26Z
dc.date.issued2010
dc.identifier.isbn9781577354871
dc.identifier.isbn1577354877
dc.identifier.otherTechnical report ; FS-10-05
dc.identifier.urihttp://hdl.handle.net/1721.1/81437
dc.description.abstractTo be useful teammates to human partners, robots must be able to follow spoken instructions given in natural language. However, determining the correct sequence of actions in response to a set of spoken instructions is a complex decision-making problem. There is a "semantic gap" between the high-level symbolic models of the world that people use, and the low-level models of geometry, state dynamics, and perceptions that robots use. In this paper, we show how this gap can be bridged by inferring the best sequence of actions from a linguistic description and environmental features. This work improves upon previous work in three ways. First, by using a conditional random field (CRF), we learn the relative weight of environmental and linguistic features, enabling the system to learn the meanings of words and reducing the modeling effort in learning how to follow commands. Second, a number of long-range features are added, which help the system to use additional structure in the problem. Finally, given a natural language command, we infer both the referred path and landmark directly, thereby requiring the algorithm to pick a landmark by which it should navigate. The CRF is demonstrated to have 15% error on a held-out dataset, when compared with 39% error for a Markov random field (MRF). Finally, by analyzing the additional annotations necessary for this work, we find that natural language route directions map sequentially onto the corresponding path and landmarks 99.6% of the time. In addition, the size of the referred landmark varies from 0m[superscript 2] to 1964m[superscript 2] and the length of the referred path varies from 0m to 40.83m.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (MURIs N00014-07-1-0749)en_US
dc.language.isoen_US
dc.publisherAmerican Association for Artificial Intelligenceen_US
dc.relation.isversionofhttp://www.aaai.org/ocs/index.php/FSS/FSS10/paper/view/2195/2741en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleA discriminative model for understanding natural language route directionsen_US
dc.typeArticleen_US
dc.identifier.citationKollar, Thomas, Stefanie Tellex, and Nicholas Roy. "A Discriminative Model for Understanding Natural Language Route Directions." Dialog with robots: papers from the AAAI Fall Symposium, 2010.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Media Laboratoryen_US
dc.contributor.mitauthorKollar, Thomas Flemingen_US
dc.contributor.mitauthorTellex, Stefanie A.en_US
dc.contributor.mitauthorRoy, Nicholasen_US
dc.relation.journalDialog with robots: papers from the AAAI Fall Symposium, 2010en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsKollar, Thomas; Tellex, Stefanie A.; Roy, Nicholasen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
dspace.mitauthor.errortrue
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record