dc.contributor.author | Huang, Shao-Lun | |
dc.contributor.author | Xu, Xiangxiang | |
dc.contributor.author | Zheng, Lizhong | |
dc.contributor.author | Wornell, Gregory W | |
dc.date.accessioned | 2021-11-09T16:06:15Z | |
dc.date.available | 2021-11-09T16:06:15Z | |
dc.date.issued | 2019-07 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/137944 | |
dc.description.abstract | © 2019 IEEE. It is commonly believed that the hidden layers of deep neural networks (DNNs) attempt to extract informative features for learning tasks. In this paper, we formalize this intuition by showing that the features extracted by DNN coincide with the result of an optimization problem, which we call the "universal feature selection" problem, in a local analysis regime. We interpret the weights training in DNN as the projection of feature functions between feature spaces, specified by the network structure. Our formulation has direct operational meaning in terms of the performance for inference tasks, and gives interpretations to the internal computation results of DNNs. Results of numerical experiments are provided to support the analysis. | en_US |
dc.language.iso | en | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | 10.1109/ISIT.2019.8849720 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | An Information Theoretic Interpretation to Deep Neural Networks | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Huang, Shao-Lun, Xu, Xiangxiang, Zheng, Lizhong and Wornell, Gregory W. 2019. "An Information Theoretic Interpretation to Deep Neural Networks." IEEE International Symposium on Information Theory - Proceedings, 2019-July. | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.relation.journal | IEEE International Symposium on Information Theory - Proceedings | en_US |
dc.eprint.version | Original manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2021-01-25T18:23:31Z | |
dspace.orderedauthors | Huang, S-L; Xu, X; Zheng, L; Wornell, GW | en_US |
dspace.date.submission | 2021-01-25T18:23:34Z | |
mit.journal.volume | 2019-July | en_US |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |