Show simple item record

dc.contributor.advisorV. Michael Bove, Jr.en_US
dc.contributor.authorLi, Yien_US
dc.contributor.otherMassachusetts Institute of Technology. Dept. of Architecture. Program in Media Arts and Sciences.en_US
dc.date.accessioned2005-05-19T15:11:37Z
dc.date.available2005-05-19T15:11:37Z
dc.date.copyright2002en_US
dc.date.issued2002en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/16894
dc.descriptionThesis (S.M.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2002.en_US
dc.descriptionIncludes bibliographical references (p. 65-66).en_US
dc.descriptionThis electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.en_US
dc.description.abstractWe developed VoiceLink, a speech interface package for responsive media applications. It contains a set of speech interface modules that can interface with various multimedia applications written in Isis, a scripting programming language created at the MIT Media Laboratory. Specifically, we designed two command-and-control voice interfaces, one for iCom, a multi-point audio/video communication system, and another for HyperSoap, a hyperlinked TV program. The iCom module enables users to control an iCom station using voice commands while the HyperSoap module allows viewers to select objects and access related information by saying objects' names. We also built a speech software library for Isis, which allows users to develop speech aware applications in the Isis programming environment. We addressed a number of problems when designing VoiceLink. In the case of the iCom module, visual information is used to seamlessly inform users of voice commands and to provide them with instant feedback and instructions, making the speech interface intuitive, flexible and easy to use for novice users. The major challenge for the HyperSoap module is the open vocabulary problem for object selection. In our design, an item list is displayed on the screen upon viewers' request to show them selectable objects. We also created an object name index to model how viewers may call objects spontaneously. Using a combination of item list and name index in the HyperSoap module produced fairly robust performance, making the speech interface a useful alternative to traditional pointing devices. The result of user evaluation is encouraging. It showed that a speech based interface for responsive media applications is not only useful but also practical.en_US
dc.description.statementofresponsibilityby Yi Li.en_US
dc.format.extent66 p.en_US
dc.format.extent1125547 bytes
dc.format.extent1125297 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/pdf
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582
dc.subjectArchitecture. Program in Media Arts and Sciences.en_US
dc.titleVoiceLink : a speech interface fore responsive mediaen_US
dc.title.alternativeVoice Link : a speech interface fore responsive mediaen_US
dc.typeThesisen_US
dc.description.degreeS.M.en_US
dc.contributor.departmentProgram in Media Arts and Sciences (Massachusetts Institute of Technology)
dc.identifier.oclc52005276en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record