High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch

Wang, Tianyu Tom; Quatieri, Thomas F.

dc.contributor.author	Wang, Tianyu Tom
dc.contributor.author	Quatieri, Thomas F.
dc.date.accessioned	2010-04-06T21:01:56Z
dc.date.available	2010-04-06T21:01:56Z
dc.date.issued	2009-10
dc.identifier.issn	1558-7916
dc.identifier.other	INSPEC Accession Number: 10940127
dc.identifier.uri	http://hdl.handle.net/1721.1/53522
dc.description.abstract	This paper considers the problem of obtaining an accurate spectral representation of speech formant structure when the voicing source exhibits a high fundamental frequency. Our work is inspired by auditory perception and physiological studies implicating the use of pitch dynamics in speech by humans. We develop and assess signal processing schemes aimed at exploiting temporal change of pitch to address the high-pitch formant frequency estimation problem. Specifically, we propose a 2-D analysis framework using 2-D transformations of the time-frequency space. In one approach, we project changing spectral harmonics over time to a 1-D function of frequency. In a second approach, we draw upon previous work of Quatieri and Ezzat , , with similarities to the auditory modeling efforts of Chi , where localized 2-D Fourier transforms of the time-frequency space provide improved source-filter separation when pitch is changing. Our methods show quantitative improvements for synthesized vowels with stationary formant structure in comparison to traditional and homomorphic linear prediction. We also demonstrate the feasibility of applying our methods on stationary vowel regions of natural speech spoken by high-pitch females of the TIMIT corpus. Finally, we show improvements afforded by the proposed analysis framework in formant tracking on examples of stationary and time-varying formant structure.	en
dc.description.sponsorship	United States. Dept. of Defense (Air Force Contract FA8721 05 C 0002)	en
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en
dc.relation.isversionof	http://dx.doi.org/10.1109/tasl.2009.2024732	en
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en
dc.source	IEEE	en
dc.subject	temporal change of pitch	en
dc.subject	spectrotemporal analysis	en
dc.subject	linear prediction	en
dc.subject	high-pitch effects	en
dc.subject	formant estimation	en
dc.title	High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch	en
dc.type	Article	en
dc.identifier.citation	Wang, T.T., and T.F. Quatieri. “High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch.” Audio, Speech, and Language Processing, IEEE Transactions on 18.1 (2010): 171-186. © 2009 Institute of Electrical and Electronics Engineers.	en
dc.contributor.department	Harvard University--MIT Division of Health Sciences and Technology	en_US
dc.contributor.department	Harvard University--MIT Division of Health Sciences and Technology	en_US
dc.contributor.department	Lincoln Laboratory	en_US
dc.contributor.approver	Quatieri, Thomas F.
dc.contributor.mitauthor	Wang, Tianyu Tom
dc.contributor.mitauthor	Quatieri, Thomas F.
dc.relation.journal	IEEE Transactions on Audio, Speech, and Language Processing,	en
dc.eprint.version	Final published version	en
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en
eprint.status	http://purl.org/eprint/status/PeerReviewed	en
dspace.orderedauthors	Wang, T.T.; Quatieri, T.F.	en
mit.license	PUBLISHER_POLICY	en
mit.metadata.status	Complete

Files in this item

Name:: Wang-2010-High-Pitch Formant E.pdf
Size:: 2.389Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record