Show simple item record

dc.contributor.authorWang, Tianyu Tom
dc.contributor.authorQuatieri, Thomas F.
dc.date.accessioned2012-07-25T15:16:01Z
dc.date.available2012-07-25T15:16:01Z
dc.date.issued2009-12
dc.date.submitted2009-10
dc.identifier.isbn978-1-4244-3679-8
dc.identifier.isbn978-1-4244-3678-1
dc.identifier.issn1931-1168
dc.identifier.urihttp://hdl.handle.net/1721.1/71798
dc.description.abstractThis paper explores a two-dimensional (2-D) processing approach for co-channel speaker separation of voiced speech. We analyze localized time-frequency regions of a narrowband spectrogram using 2-D Fourier transforms and propose a 2-D amplitude modulation model based on pitch information for single and multi-speaker content in each region. Our model maps harmonically-related speech content to concentrated entities in a transformed 2-D space, thereby motivating 2-D demodulation of the spectrogram for analysis/synthesis and speaker separation. Using a priori pitch estimates of individual speakers, we show through a quantitative evaluation: 1) Utility of the model for representing speech content of a single speaker and 2) Its feasibility for speaker separation. For the separation task, we also illustrate benefits of the model's representation of pitch dynamics relative to a sinusoidal-based separation system.en_US
dc.description.sponsorshipUnited States. Dept. of Defense. Air Force (Contract FA8721-05-C-0002)en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ASPAA.2009.5346526en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceIEEEen_US
dc.titleTowards co-channel speaker separation BY 2-D demodulation of spectrogramsen_US
dc.typeArticleen_US
dc.identifier.citationWang, Tianyu T., and Thomas F. Quatieri. “Towards Co-channel Speaker Separation BY 2-D Demodulation of Spectrograms.” IEEE, 2009. 65–68. © Copyright 2009 IEEEen_US
dc.contributor.departmentLincoln Laboratoryen_US
dc.contributor.approverQuatieri, Thomas F.
dc.contributor.mitauthorWang, Tianyu Tom
dc.contributor.mitauthorQuatieri, Thomas F.
dc.relation.journalIEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009.en_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsWang, Tianyu T.; Quatieri, Thomas F.en
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record