Fine-tuning generative models
Author(s)
Khandelwal, Arjun(Arjun Sunil)
Download1145123030-MIT.pdf (1014.Kb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
David Sontag.
Terms of use
Metadata
Show full item recordAbstract
Deep generative models have emerged as a powerful modeling paradigm for making sense of large amounts of unlabeled real-world data. In particular, the representations produced by these models have proven to be useful both in improving human understanding of the factors of variation in the original dataset and in downstream tasks such as classification. Most current algorithms, however, require training a bespoke model from scratch, which can be both expensive and time-consuming. Instead, we propose various methods of fine-tuning pre-trained generative models to achieve these goals, and evaluate these methods quantitatively on few-shot classification and interpretability tasks.
Description
This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2019 Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 73-76).
Date issued
2019Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.