A discriminative model for understanding natural language route directions

Kollar, Thomas; Tellex, Stefanie A.; Roy, Nicholas

dc.contributor.author	Kollar, Thomas Fleming
dc.contributor.author	Tellex, Stefanie A.
dc.contributor.author	Roy, Nicholas
dc.date.accessioned	2013-10-18T17:54:26Z
dc.date.available	2013-10-18T17:54:26Z
dc.date.issued	2010
dc.identifier.isbn	9781577354871
dc.identifier.isbn	1577354877
dc.identifier.other	Technical report ; FS-10-05
dc.identifier.uri	http://hdl.handle.net/1721.1/81437
dc.description.abstract	To be useful teammates to human partners, robots must be able to follow spoken instructions given in natural language. However, determining the correct sequence of actions in response to a set of spoken instructions is a complex decision-making problem. There is a "semantic gap" between the high-level symbolic models of the world that people use, and the low-level models of geometry, state dynamics, and perceptions that robots use. In this paper, we show how this gap can be bridged by inferring the best sequence of actions from a linguistic description and environmental features. This work improves upon previous work in three ways. First, by using a conditional random field (CRF), we learn the relative weight of environmental and linguistic features, enabling the system to learn the meanings of words and reducing the modeling effort in learning how to follow commands. Second, a number of long-range features are added, which help the system to use additional structure in the problem. Finally, given a natural language command, we infer both the referred path and landmark directly, thereby requiring the algorithm to pick a landmark by which it should navigate. The CRF is demonstrated to have 15% error on a held-out dataset, when compared with 39% error for a Markov random field (MRF). Finally, by analyzing the additional annotations necessary for this work, we find that natural language route directions map sequentially onto the corresponding path and landmarks 99.6% of the time. In addition, the size of the referred landmark varies from 0m[superscript 2] to 1964m[superscript 2] and the length of the referred path varies from 0m to 40.83m.	en_US
dc.description.sponsorship	United States. Office of Naval Research (MURIs N00014-07-1-0749)	en_US
dc.language.iso	en_US
dc.publisher	American Association for Artificial Intelligence	en_US
dc.relation.isversionof	http://www.aaai.org/ocs/index.php/FSS/FSS10/paper/view/2195/2741	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	MIT web domain	en_US
dc.title	A discriminative model for understanding natural language route directions	en_US
dc.type	Article	en_US
dc.identifier.citation	Kollar, Thomas, Stefanie Tellex, and Nicholas Roy. "A Discriminative Model for Understanding Natural Language Route Directions." Dialog with robots: papers from the AAAI Fall Symposium, 2010.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.contributor.department	Massachusetts Institute of Technology. Media Laboratory	en_US
dc.contributor.mitauthor	Kollar, Thomas Fleming	en_US
dc.contributor.mitauthor	Tellex, Stefanie A.	en_US
dc.contributor.mitauthor	Roy, Nicholas	en_US
dc.relation.journal	Dialog with robots: papers from the AAAI Fall Symposium, 2010	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Kollar, Thomas; Tellex, Stefanie A.; Roy, Nicholas	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-8293-0492
dspace.mitauthor.error	true
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Roy_A discriminative model.pdf
Size:: 325.4Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record