Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137109.2

Show simple item record

dc.contributor.authorAnanthabhotla, I
dc.contributor.authorEwert, S
dc.contributor.authorParadiso, JA
dc.date.accessioned2021-11-02T16:57:11Z
dc.date.available2021-11-02T16:57:11Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/137109
dc.description.abstract© 2020 IEEE. A growing need for on-device machine learning has led to an increased interest in light-weight neural networks that lower model complexity while retaining performance. While a variety of general-purpose techniques exist in this context, very few approaches exploit domain-specific properties to further improve upon the capacity-performance trade-off. In this paper, extending our prior work [1], we train a network to emulate the behaviour of an audio codec and use this network to construct a loss. By approximating the psychoacoustic model underlying the codec, our approach enables light-weight neural networks to focus on perceptually relevant properties without wasting their limited capacity on imperceptible signal components. We adapt our method to two audio source separation tasks, demonstrate an improvement in performance for small-scale networks via listening tests, characterize the behaviour of the loss network in detail, and quantify the relationship between performance gain and model capacity. Our work illustrates the potential for incorporating perceptual principles into objective functions for neural networks.en_US
dc.language.isoen
dc.publisherIEEEen_US
dc.relation.isversionof10.1109/IJCNN48605.2020.9207053en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleUsing a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networksen_US
dc.typeArticleen_US
dc.identifier.citationAnanthabhotla, I, Ewert, S and Paradiso, JA. 2020. "Using a Neural Network Codec Approximation Loss to Improve Source Separation Performance in Limited Capacity Networks." Proceedings of the International Joint Conference on Neural Networks.
dc.relation.journalProceedings of the International Joint Conference on Neural Networksen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-06-25T17:55:19Z
dspace.orderedauthorsAnanthabhotla, I; Ewert, S; Paradiso, JAen_US
dspace.date.submission2021-06-25T17:55:21Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version