Segmentation and Alignment of Speech and Sketching in a Design Environment

Adler, Aaron D.

Author(s)

Adler, Aaron D.

DownloadAITR-2003-004.ps (32.83Mb)

Additional downloads

AITR-2003-004.pdf (44.01Mb)

Metadata

Show full item record

Abstract

Sketches are commonly used in the early stages of design. Our previous system allows users to sketch mechanical systems that the computer interprets. However, some parts of the mechanical system might be too hard or too complicated to express in the sketch. Adding speech recognition to create a multimodal system would move us toward our goal of creating a more natural user interface. This thesis examines the relationship between the verbal and sketch input, particularly how to segment and align the two inputs. Toward this end, subjects were recorded while they sketched and talked. These recordings were transcribed, and a set of rules to perform segmentation and alignment was created. These rules represent the knowledge that the computer needs to perform segmentation and alignment. The rules successfully interpreted the 24 data sets that they were given.

Date issued

2003-02-01

URI

http://hdl.handle.net/1721.1/7103

Other identifiers

AITR-2003-004

Series/Report no.

AITR-2003-004

Keywords

AI, sketch, design, multimodal, disambiguation, segmentation, alignment

Collections

AI Technical Reports (1964 - 2004)