dc.contributor.author | Xie, Yudi | |
dc.date.accessioned | 2025-04-02T17:13:49Z | |
dc.date.available | 2025-04-02T17:13:49Z | |
dc.date.issued | 2025-04-28 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/159032 | |
dc.description | Blogposts Track. ICLR 2025, 24-28 April, Singapore. | en_US |
dc.description.abstract | Deep neural networks are widely used for classification tasks, but the interpretation of their output activations is often unclear. This tutorial article explains
how these outputs can be understood as approximations of the Bayesian posterior.
We showed that, in theory, the loss function for classification tasks – derived by
maximum likelihood – is minimized by the Bayesian posterior. We conducted
empirical studies training neural networks to classify synthetic data from a known
generative model. In a simple classification task, the network closely approximates the theoretically derived posterior. However, a few changes in the task can
make accurate approximation much more difficult. The ability of the networks to
approximate the posterior depends on multiple factors, such as the complexity of
the posterior and whether there is sufficient data for learning. | en_US |
dc.publisher | International Conference on Learning Representations | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-ShareAlike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | Author | en_US |
dc.title | How do we interpret the outputs of a neural network trained on classification? | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Xie, Yudi. 2025. "How do we interpret the outputs of a neural network trained on classification?." | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferenceItem | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.date.submission | 2025-03-24T15:52:08Z | |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |