dc.contributor.advisor | Torralba, Antonio | |
dc.contributor.author | Elango, Mahalaxmi | |
dc.date.accessioned | 2022-01-14T14:52:58Z | |
dc.date.available | 2022-01-14T14:52:58Z | |
dc.date.issued | 2021-06 | |
dc.date.submitted | 2021-06-17T20:13:11.250Z | |
dc.identifier.uri | https://hdl.handle.net/1721.1/139151 | |
dc.description.abstract | Observations of various deep neural network architectures indicate that deep networks may be spontaneously learning representations of concepts with semantic meaning, and encoding a relational structure or rule between these concepts. We refer to these encoded relationships between concepts in the network as rules. In classifiers, we rewrite an existing rule in the network as desired, referred to as the rewriting technique.
We demonstrate that using our rewriting technique and simple human knowledge about how to classify the world around us, we can generalize existing classes to unseen variants, identify spurious correlations present in the dataset, mitigate the effects of spurious correlations, and introduce new classes. We find that our technique reduces the need for: computing resources, because we only re-train a single layer’s weights; new training images, because our rewriting technique can rewrite using concepts already encoded in the network; and domain knowledge, because what we choose to edit to improve classification is derived from logical rules a human would construct to classify images. | |
dc.publisher | Massachusetts Institute of Technology | |
dc.rights | In Copyright - Educational Use Permitted | |
dc.rights | Copyright MIT | |
dc.rights.uri | http://rightsstatements.org/page/InC-EDU/1.0/ | |
dc.title | Rewriting the Rules of a Classifier | |
dc.type | Thesis | |
dc.description.degree | M.Eng. | |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
mit.thesis.degree | Master | |
thesis.degree.name | Master of Engineering in Electrical Engineering and Computer Science | |