Towards co-channel speaker separation BY 2-D demodulation of spectrograms

Wang, Tianyu T.; Quatieri, Thomas F.

dc.contributor.author	Wang, Tianyu Tom
dc.contributor.author	Quatieri, Thomas F.
dc.date.accessioned	2012-07-25T15:16:01Z
dc.date.available	2012-07-25T15:16:01Z
dc.date.issued	2009-12
dc.date.submitted	2009-10
dc.identifier.isbn	978-1-4244-3679-8
dc.identifier.isbn	978-1-4244-3678-1
dc.identifier.issn	1931-1168
dc.identifier.uri	http://hdl.handle.net/1721.1/71798
dc.description.abstract	This paper explores a two-dimensional (2-D) processing approach for co-channel speaker separation of voiced speech. We analyze localized time-frequency regions of a narrowband spectrogram using 2-D Fourier transforms and propose a 2-D amplitude modulation model based on pitch information for single and multi-speaker content in each region. Our model maps harmonically-related speech content to concentrated entities in a transformed 2-D space, thereby motivating 2-D demodulation of the spectrogram for analysis/synthesis and speaker separation. Using a priori pitch estimates of individual speakers, we show through a quantitative evaluation: 1) Utility of the model for representing speech content of a single speaker and 2) Its feasibility for speaker separation. For the separation task, we also illustrate benefits of the model's representation of pitch dynamics relative to a sinusoidal-based separation system.	en_US
dc.description.sponsorship	United States. Dept. of Defense. Air Force (Contract FA8721-05-C-0002)	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ASPAA.2009.5346526	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	IEEE	en_US
dc.title	Towards co-channel speaker separation BY 2-D demodulation of spectrograms	en_US
dc.type	Article	en_US
dc.identifier.citation	Wang, Tianyu T., and Thomas F. Quatieri. “Towards Co-channel Speaker Separation BY 2-D Demodulation of Spectrograms.” IEEE, 2009. 65–68. © Copyright 2009 IEEE	en_US
dc.contributor.department	Lincoln Laboratory	en_US
dc.contributor.approver	Quatieri, Thomas F.
dc.contributor.mitauthor	Wang, Tianyu Tom
dc.contributor.mitauthor	Quatieri, Thomas F.
dc.relation.journal	IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009.	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Wang, Tianyu T.; Quatieri, Thomas F.	en
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Wang-2009-Towards co-channel ...
Size:: 2.897Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record