Show simple item record

dc.contributor.authorGlass, James R.
dc.contributor.authorZhu, Bo
dc.contributor.authorLivescu, Karen
dc.date.accessioned2010-11-05T22:05:57Z
dc.date.available2010-11-05T22:05:57Z
dc.date.issued2009-04
dc.identifier.isbn978-1-4244-2353-8
dc.identifier.issn1520-6149
dc.identifier.otherINSPEC Accession Number: 10701240
dc.identifier.urihttp://hdl.handle.net/1721.1/59850
dc.description.abstractWe study the phonetic information in the signal from an ultrasonic "microphone", a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-shifted signal. This can be used in addition to audio to improve automatic speech recognition. This work is an effort to better understand the ultrasonic signal, and potentially to determine a set of natural sub-word units. We present classification and clustering experiments on CVC and VCV sequences in speaker-dependent and multi-speaker settings. Using a set of ultrasonic spectral features and diagonal Gaussian models, it is possible to distinguish all consonants and most vowels. When clustering the confusion data, the consonant clusters mostly correspond to places and manners of articulation; the vowel data roughly clusters into high, low, and rounded vowels.en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (Training Grant T32 DC000038)en_US
dc.description.sponsorshipHenry Luce Foundation. Clare Booth Luce Fellowshipen_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ICASSP.2009.4960660en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceIEEEen_US
dc.titleOn the phonetic information in ultrasonic microphone signalsen_US
dc.typeArticleen_US
dc.identifier.citationLivescu, K., Bo Zhu, and J. Glass. “On the phonetic information in ultrasonic microphone signals.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 4621-4624. © 2009, IEEEen_US
dc.contributor.departmentWhitaker College of Health Sciences and Technologyen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.approverGlass, James R.
dc.contributor.mitauthorGlass, James R.
dc.contributor.mitauthorZhu, Bo
dc.relation.journalIEEE International Converence on Acoustics, Speech and Signal Processingen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsLivescu, Karen; Zhu, Bo; Glass, Jamesen
dc.identifier.orcidhttps://orcid.org/0000-0002-3097-360X
dc.identifier.orcidhttps://orcid.org/0000-0002-0958-1783
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record