How do we interpret the outputs of a neural network trained on classification?

Xie, Yudi

dc.contributor.author	Xie, Yudi
dc.date.accessioned	2025-04-02T17:13:49Z
dc.date.available	2025-04-02T17:13:49Z
dc.date.issued	2025-04-28
dc.identifier.uri	https://hdl.handle.net/1721.1/159032
dc.description	Blogposts Track. ICLR 2025, 24-28 April, Singapore.	en_US
dc.description.abstract	Deep neural networks are widely used for classification tasks, but the interpretation of their output activations is often unclear. This tutorial article explains how these outputs can be understood as approximations of the Bayesian posterior. We showed that, in theory, the loss function for classification tasks – derived by maximum likelihood – is minimized by the Bayesian posterior. We conducted empirical studies training neural networks to classify synthetic data from a known generative model. In a simple classification task, the network closely approximates the theoretically derived posterior. However, a few changes in the task can make accurate approximation much more difficult. The ability of the networks to approximate the posterior depends on multiple factors, such as the complexity of the posterior and whether there is sufficient data for learning.	en_US
dc.publisher	International Conference on Learning Representations	en_US
dc.rights	Creative Commons Attribution-Noncommercial-ShareAlike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	Author	en_US
dc.title	How do we interpret the outputs of a neural network trained on classification?	en_US
dc.type	Article	en_US
dc.identifier.citation	Xie, Yudi. 2025. "How do we interpret the outputs of a neural network trained on classification?."
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferenceItem	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.date.submission	2025-03-24T15:52:08Z
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: YudiXie_How_do_we_interpret_th ...
Size:: 2.522Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record