On the phonetic information in ultrasonic microphone signals
Author(s)
Glass, James R.; Zhu, Bo; Livescu, Karen
DownloadLivescu-2009-On the phonetic information in ultrasonic microphone signals.pdf (267.4Kb)
PUBLISHER_POLICY
Publisher Policy
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
We study the phonetic information in the signal from an ultrasonic "microphone", a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-shifted signal. This can be used in addition to audio to improve automatic speech recognition. This work is an effort to better understand the ultrasonic signal, and potentially to determine a set of natural sub-word units. We present classification and clustering experiments on CVC and VCV sequences in speaker-dependent and multi-speaker settings. Using a set of ultrasonic spectral features and diagonal Gaussian models, it is possible to distinguish all consonants and most vowels. When clustering the confusion data, the consonant clusters mostly correspond to places and manners of articulation; the vowel data roughly clusters into high, low, and rounded vowels.
Date issued
2009-04Department
Whitaker College of Health Sciences and Technology; Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryJournal
IEEE International Converence on Acoustics, Speech and Signal Processing
Publisher
Institute of Electrical and Electronics Engineers
Citation
Livescu, K., Bo Zhu, and J. Glass. “On the phonetic information in ultrasonic microphone signals.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 4621-4624. © 2009, IEEE
Version: Final published version
Other identifiers
INSPEC Accession Number: 10701240
ISBN
978-1-4244-2353-8
ISSN
1520-6149