An Information Theoretic Interpretation to Deep Neural Networks

Huang, Shao-Lun; Xu, Xiangxiang; Zheng, Lizhong; Wornell, Gregory W

dc.contributor.author	Huang, Shao-Lun
dc.contributor.author	Xu, Xiangxiang
dc.contributor.author	Zheng, Lizhong
dc.contributor.author	Wornell, Gregory W
dc.date.accessioned	2021-11-09T16:06:15Z
dc.date.available	2021-11-09T16:06:15Z
dc.date.issued	2019-07
dc.identifier.uri	https://hdl.handle.net/1721.1/137944
dc.description.abstract	© 2019 IEEE. It is commonly believed that the hidden layers of deep neural networks (DNNs) attempt to extract informative features for learning tasks. In this paper, we formalize this intuition by showing that the features extracted by DNN coincide with the result of an optimization problem, which we call the "universal feature selection" problem, in a local analysis regime. We interpret the weights training in DNN as the projection of feature functions between feature spaces, specified by the network structure. Our formulation has direct operational meaning in terms of the performance for inference tasks, and gives interpretations to the internal computation results of DNNs. Results of numerical experiments are provided to support the analysis.	en_US
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	10.1109/ISIT.2019.8849720	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	An Information Theoretic Interpretation to Deep Neural Networks	en_US
dc.type	Article	en_US
dc.identifier.citation	Huang, Shao-Lun, Xu, Xiangxiang, Zheng, Lizhong and Wornell, Gregory W. 2019. "An Information Theoretic Interpretation to Deep Neural Networks." IEEE International Symposium on Information Theory - Proceedings, 2019-July.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journal	IEEE International Symposium on Information Theory - Proceedings	en_US
dc.eprint.version	Original manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-01-25T18:23:31Z
dspace.orderedauthors	Huang, S-L; Xu, X; Zheng, L; Wornell, GW	en_US
dspace.date.submission	2021-01-25T18:23:34Z
mit.journal.volume	2019-July	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 1905.06600.pdf
Size:: 681.7Kb
Format:: PDF
Description:: Submitted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record