Beneficial Initializations in Over-Parameterized Machine Learning Problems

Prasad, Neha

dc.contributor.advisor	Uhler, Caroline
dc.contributor.author	Prasad, Neha
dc.date.accessioned	2022-01-14T14:46:51Z
dc.date.available	2022-01-14T14:46:51Z
dc.date.issued	2021-06
dc.date.submitted	2021-06-17T20:14:05.349Z
dc.identifier.uri	https://hdl.handle.net/1721.1/139050
dc.description.abstract	We theoretically and empirically analyze the phenomenon of transfer learning in overparameterized machine learning. We start by showing that in over-parameterized linear regression, transfer learning is equivalent to solving regression from a non-zero initialization. We use this finding to propose LLBoost, a theoretically grounded, computationally efficient method to boost the validation and test accuracy of pretrained, over-parameterized models without impacting the original training accuracy. We evaluate LLBoost on CIFAR10, ImageNet-32, and ImageNet and also prove that it reduces the generalization error of any interpolating solution with high probability. By extending our analysis of transfer learning in linear regression, we present an approach for transfer learning in kernel regression. Namely, we demonstrate that transfer learning corresponds to adding a function to the minimum norm solution that produces zero error on the training data. We use this approach to perform transfer learning on image classification tasks using the neural tangent kernel.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright MIT
dc.rights.uri	http://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Beneficial Initializations in Over-Parameterized Machine Learning Problems
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: Prasad-nehap-meng-eecs-2021-th ...
Size:: 1.895Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record