A discriminative model for understanding natural language route directions

Kollar, Thomas; Tellex, Stefanie A.; Roy, Nicholas

Author(s)

Kollar, Thomas Fleming; Tellex, Stefanie A.; Roy, Nicholas

DownloadRoy_A discriminative model.pdf (325.4Kb)

OPEN_ACCESS_POLICY

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/

Metadata

Show full item record

Abstract

To be useful teammates to human partners, robots must be able to follow spoken instructions given in natural language. However, determining the correct sequence of actions in response to a set of spoken instructions is a complex decision-making problem. There is a "semantic gap" between the high-level symbolic models of the world that people use, and the low-level models of geometry, state dynamics, and perceptions that robots use. In this paper, we show how this gap can be bridged by inferring the best sequence of actions from a linguistic description and environmental features. This work improves upon previous work in three ways. First, by using a conditional random field (CRF), we learn the relative weight of environmental and linguistic features, enabling the system to learn the meanings of words and reducing the modeling effort in learning how to follow commands. Second, a number of long-range features are added, which help the system to use additional structure in the problem. Finally, given a natural language command, we infer both the referred path and landmark directly, thereby requiring the algorithm to pick a landmark by which it should navigate. The CRF is demonstrated to have 15% error on a held-out dataset, when compared with 39% error for a Markov random field (MRF). Finally, by analyzing the additional annotations necessary for this work, we find that natural language route directions map sequentially onto the corresponding path and landmarks 99.6% of the time. In addition, the size of the referred landmark varies from 0m[superscript 2] to 1964m[superscript 2] and the length of the referred path varies from 0m to 40.83m.

Date issued

2010

URI

http://hdl.handle.net/1721.1/81437

Department

Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Aeronautics and Astronautics; Massachusetts Institute of Technology. Media Laboratory

Journal

Dialog with robots: papers from the AAAI Fall Symposium, 2010

Publisher

American Association for Artificial Intelligence

Citation

Kollar, Thomas, Stefanie Tellex, and Nicholas Roy. "A Discriminative Model for Understanding Natural Language Route Directions." Dialog with robots: papers from the AAAI Fall Symposium, 2010.

Version: Author's final manuscript

Other identifiers

Technical report ; FS-10-05

ISBN

9781577354871

1577354877

Collections

MIT Open Access Articles

DSpace@MIT