dc.contributor.advisor | V. Michael Bove, Jr. | en_US |
dc.contributor.author | Li, Yi | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences. | en_US |
dc.date.accessioned | 2005-05-19T15:11:37Z | |
dc.date.available | 2005-05-19T15:11:37Z | |
dc.date.copyright | 2002 | en_US |
dc.date.issued | 2002 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/16894 | |
dc.description | Thesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2002. | en_US |
dc.description | Includes bibliographical references (p. 65-66). | en_US |
dc.description | This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. | en_US |
dc.description.abstract | We developed VoiceLink, a speech interface package for responsive media applications. It contains a set of speech interface modules that can interface with various multimedia applications written in Isis, a scripting programming language created at the MIT Media Laboratory. Specifically, we designed two command-and-control voice interfaces, one for iCom, a multi-point audio/video communication system, and another for HyperSoap, a hyperlinked TV program. The iCom module enables users to control an iCom station using voice commands while the HyperSoap module allows viewers to select objects and access related information by saying objects' names. We also built a speech software library for Isis, which allows users to develop speech aware applications in the Isis programming environment. We addressed a number of problems when designing VoiceLink. In the case of the iCom module, visual information is used to seamlessly inform users of voice commands and to provide them with instant feedback and instructions, making the speech interface intuitive, flexible and easy to use for novice users. The major challenge for the HyperSoap module is the open vocabulary problem for object selection. In our design, an item list is displayed on the screen upon viewers' request to show them selectable objects. We also created an object name index to model how viewers may call objects spontaneously. Using a combination of item list and name index in the HyperSoap module produced fairly robust performance, making the speech interface a useful alternative to traditional pointing devices. The result of user evaluation is encouraging. It showed that a speech based interface for responsive media applications is not only useful but also practical. | en_US |
dc.description.statementofresponsibility | by Yi Li. | en_US |
dc.format.extent | 66 p. | en_US |
dc.format.extent | 1125547 bytes | |
dc.format.extent | 1125297 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | |
dc.subject | Architecture. Program in Media Arts and Sciences. | en_US |
dc.title | VoiceLink : a speech interface fore responsive media | en_US |
dc.title.alternative | Voice Link : a speech interface fore responsive media | en_US |
dc.type | Thesis | en_US |
dc.description.degree | S.M. | en_US |
dc.contributor.department | Program in Media Arts and Sciences (Massachusetts Institute of Technology) | |
dc.identifier.oclc | 52005276 | en_US |