Show simple item record

dc.contributor.authorGlass, James R.
dc.contributor.authorZhang, Yaodong, Ph. D. Massachusetts Institute of Technology
dc.date.accessioned2010-12-06T23:03:25Z
dc.date.available2010-12-06T23:03:25Z
dc.date.issued2009-04
dc.identifier.isbn978-1-4244-2353-8
dc.identifier.issn1520-6149
dc.identifier.otherINSPEC Accession Number: 10701647
dc.identifier.urihttp://hdl.handle.net/1721.1/60218
dc.description.abstractIn this paper, we present a novel speech-rhythm-guided syllable-nuclei location detection algorithm. As a departure from conventional methods, we introduce an instantaneous speech rhythm estimator to predict possible regions where syllable nuclei can appear. Within a possible region, a simple slope based peak counting algorithm is used to get the exact location of each syllable nucleus. We verify the correctness of our method by investigating the syllable nuclei interval distribution in TIMIT dataset, and evaluate the performance by comparing with a state-of-the-art syllable nuclei based speech rate detection approach.en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ICASSP.2009.4960454en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceIEEEen_US
dc.titleSpeech rhythm guided syllable nuclei detectionen_US
dc.typeArticleen_US
dc.identifier.citationYaodong Zhang, and J.R. Glass. “Speech rhythm guided syllable nuclei detection.” Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on. 2009. 3797-3800. © 2009, IEEEen_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverGlass, James R.
dc.contributor.mitauthorGlass, James R.
dc.contributor.mitauthorZhang, Yaodong
dc.relation.journalIEEE International Conference on Acoustics, Speech and Signal Processingen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsZhang, Yaodong; Glass, James R.en
dc.identifier.orcidhttps://orcid.org/0000-0002-3097-360X
dspace.mitauthor.errortrue
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record