Noise reduction algorithms and performance metrics for improving speech reception in noise by cochlear-implant users
Author(s)
Goldsworthy, Raymond Lee, 1974-
DownloadFull printable version (16.06Mb)
Other Contributors
Harvard University--MIT Division of Health Sciences and Technology.
Advisor
Julie E. Greenberg.
Terms of use
Metadata
Show full item recordAbstract
This thesis addresses the design and evaluation of algorithms to improve speech reception for cochlear-implant (CI) users in adverse listening environments. We develop and assess performance metrics for use in the algorithm design process; such metrics make algorithm evaluation efficient, consistent, and subject independent. One promising performance metric is the Speech Transmission Index (STI), which is well correlated with speech reception by normal-hearing listeners for additive noise and reverberation. We expect the STI will effectively predict speech reception by CI users since typical CI sound-processing strategies, like the STI, rely on the envelope signals in frequency bands spanning the speech spectrum. However, STI-based metrics have proven unsatisfactory for assessing the effects of nonlinear operations on the intelligibility of processed speech. In this work we consider modifications to the STI that account for nonlinear operations commonly found in CI sound-processing and noise reduction algorithms. We consider a number of existing speech-based STI metrics and propose novel metrics applicable to nonlinear operations. A preliminary evaluation results in the selection of three candidate metrics for extensive evaluation. In four central experiments, we consider the effects of acoustic degradation, N-of-M processing, spectral subtraction, and binaural noise reduction on the intelligibility of CI-processed speech. We assess the ability of the candidate metrics to predict speech reception scores. (cont.) Subjects include CI users as well as normal-hearing subjects listening to a noise-vocoder simulation of CI sound-processing. Our results show that: 1) both spectral subtraction and binaural noise reduction improve the intelligibility of CI-processed speech and 2) of the candidate metrics, one method (the normalized correlation metric) consistently predicts the major trends in speech reception scores for all four experiments.
Description
Thesis (Ph. D.)--Harvard University--MIT Division of Health Sciences and Technology, 2005. Includes bibliographical references (p. 229-233).
Date issued
2005Department
Harvard University--MIT Division of Health Sciences and TechnologyPublisher
Massachusetts Institute of Technology
Keywords
Harvard University--MIT Division of Health Sciences and Technology.