Show simple item record

dc.contributor.advisorRobert E. Hillman and Kenneth N. Stevens.en_US
dc.contributor.authorMeltzner, Geoffrey S. (Geoffrey Seth), 1973-en_US
dc.contributor.otherHarvard University--MIT Division of Health Sciences and Technology.en_US
dc.date.accessioned2006-03-24T16:23:03Z
dc.date.available2006-03-24T16:23:03Z
dc.date.copyright2003en_US
dc.date.issued2003en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/29760
dc.descriptionThesis (Ph. D.)--Harvard-MIT Division of Health Sciences and Technology, 2003.en_US
dc.descriptionIncludes bibliographical references (p. 167-171).en_US
dc.description.abstractAdvanced laryngeal cancer is often treated by surgical removal of the larynx (laryngectomy) thus rendering patients unable to produce normal voice and speech. Laryngectomy patients must rely on an alternative means of producing voice and speech, with the most common method being the use of an electrolarynx (EL). The EL is a small, hand-held, electromechanical device that acoustically excites the vocal tract when held against the neck or at the lips. While the EL provides a serviceable means of communication, the resulting speech has several shortcomings in terms of both intelligibility and speech quality. Previous studies have identified and tried to correct different single selected acoustic properties associated with the abnormal quality of EL speech, but with only limited success. There remains uncertainty about: 1) which components of the EL speech acoustic signal are contributing most to its abnormal quality and 2) what kinds of acoustic enhancements would be most effective in improving the quality of EL speech. Using a combination of listening experiments, acoustic analysis and acoustic modeling, this thesis investigated the perceptual and acoustic impacts of several aberrant properties of EL speech, with the overall goal of using the results to direct future EL speech improvement efforts. Perceptual experiments conducted by having 10 listeners judge the naturalness of differently enhanced versions of EL speech demonstrated that adding pitch information would produce the most benefit. Removing the EL self-noise and correcting for a lack of low frequency energy would also improve EL speech, but to a lesser extent. However,en_US
dc.description.abstract(cont.) this study also demonstrated that monotonous, normal speech was found to be more natural than any version of EL speech, indicating that there are other abnormal properties of EL speech contributing to its unnatural quality. An acoustic analysis of a corpus of pre- and post-laryngectomy speech revealed that changes in vocal tract anatomy produce narrower formant bandwidths and spectral zeros that alter the spectral properties of EL speech. Vocal tract modeling confirmed that these spectral zeros are a function of EL placement and thus their effects will vary from user to user. Even though the addition of pitch information was associated with the greatest improvement in EL speech quality, its implementation is not currently possible because it would require access to underlying linguistic and/or neural processes. Based on these findings it was concluded that an enhancement algorithm that corrects for the low frequency deficit, the interference of the EL self-noise, the narrower formant bandwidths, and the effect of the source location, should produce EL speech whose quality surpasses what is currently available.en_US
dc.description.statementofresponsibilityby Geoffrey Seth Meltzner.en_US
dc.format.extent171 p.en_US
dc.format.extent12840980 bytes
dc.format.extent12840788 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/pdf
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582
dc.subjectHarvard University--MIT Division of Health Sciences and Technology.en_US
dc.titlePerceptual and acoustic impacts of aberrant properties of electrolaryngeal speechen_US
dc.typeThesisen_US
dc.description.degreePh.D.en_US
dc.contributor.departmentHarvard University--MIT Division of Health Sciences and Technology
dc.identifier.oclc54665804en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record