dc.contributor.author | Wang, Tianyu Tom | |
dc.contributor.author | Quatieri, Thomas F. | |
dc.date.accessioned | 2012-07-25T15:16:01Z | |
dc.date.available | 2012-07-25T15:16:01Z | |
dc.date.issued | 2009-12 | |
dc.date.submitted | 2009-10 | |
dc.identifier.isbn | 978-1-4244-3679-8 | |
dc.identifier.isbn | 978-1-4244-3678-1 | |
dc.identifier.issn | 1931-1168 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/71798 | |
dc.description.abstract | This paper explores a two-dimensional (2-D) processing approach for co-channel speaker separation of voiced speech. We analyze localized time-frequency regions of a narrowband spectrogram using 2-D Fourier transforms and propose a 2-D amplitude modulation model based on pitch information for single and multi-speaker content in each region. Our model maps harmonically-related speech content to concentrated entities in a transformed 2-D space, thereby motivating 2-D demodulation of the spectrogram for analysis/synthesis and speaker separation. Using a priori pitch estimates of individual speakers, we show through a quantitative evaluation: 1) Utility of the model for representing speech content of a single speaker and 2) Its feasibility for speaker separation. For the separation task, we also illustrate benefits of the model's representation of pitch dynamics relative to a sinusoidal-based separation system. | en_US |
dc.description.sponsorship | United States. Dept. of Defense. Air Force (Contract FA8721-05-C-0002) | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/ASPAA.2009.5346526 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | IEEE | en_US |
dc.title | Towards co-channel speaker separation BY 2-D demodulation of spectrograms | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Wang, Tianyu T., and Thomas F. Quatieri. “Towards Co-channel Speaker Separation BY 2-D Demodulation of Spectrograms.” IEEE, 2009. 65–68. © Copyright 2009 IEEE | en_US |
dc.contributor.department | Lincoln Laboratory | en_US |
dc.contributor.approver | Quatieri, Thomas F. | |
dc.contributor.mitauthor | Wang, Tianyu Tom | |
dc.contributor.mitauthor | Quatieri, Thomas F. | |
dc.relation.journal | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009. | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
dspace.orderedauthors | Wang, Tianyu T.; Quatieri, Thomas F. | en |
mit.license | PUBLISHER_POLICY | en_US |
mit.metadata.status | Complete | |