Learning new tricks from old dogs: Multi-source transfer learning from pre-trained networks

Lee, JKW; Sattigeri, P; Wornell, GW

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/137379.2

Show simple item record

dc.contributor.author	Lee, JKW
dc.contributor.author	Sattigeri, P
dc.contributor.author	Wornell, GW
dc.date.accessioned	2021-11-04T17:21:35Z
dc.date.available	2021-11-04T17:21:35Z
dc.date.issued	2019-12
dc.identifier.uri	https://hdl.handle.net/1721.1/137379
dc.description.abstract	© 2019 Neural information processing systems foundation. All rights reserved. The advent of deep learning algorithms for mobile devices and sensors has led to a dramatic expansion in the availability and number of systems trained on a wide range of machine learning tasks, creating a host of opportunities and challenges in the realm of transfer learning. Currently, most transfer learning methods require some kind of control over the systems learned, either by enforcing constraints during the source training, or through the use of a joint optimization objective between tasks that requires all data be co-located for training. However, for practical, privacy, or other reasons, in a variety of applications we may have no control over the individual source task training, nor access to source training samples. Instead we only have access to features pre-trained on such data as the output of “black-boxes.” For such scenarios, we consider the multi-source learning problem of training a classifier using an ensemble of pre-trained neural networks for a set of classes that have not been observed by any of the source networks, and for which we have very few training samples. We show that by using these distributed networks as feature extractors, we can train an effective classifier in a computationally-efficient manner using tools from (nonlinear) maximal correlation analysis. In particular, we develop a method we refer to as maximal correlation weighting (MCW) to build the required target classifier from an appropriate weighting of the feature functions from the source networks. We illustrate the effectiveness of the resulting classifier on datasets derived from the CIFAR-100, Stanford Dogs, and Tiny ImageNet datasets, and, in addition, use the methodology to characterize the relative value of different source tasks in learning a target task.	en_US
dc.language.iso	en
dc.relation.isversionof	https://papers.nips.cc/paper/2019/hash/6048ff4e8cb07aa60b6777b6f7384d52-Abstract.html	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Neural Information Processing Systems (NIPS)	en_US
dc.title	Learning new tricks from old dogs: Multi-source transfer learning from pre-trained networks	en_US
dc.type	Article	en_US
dc.identifier.citation	Lee, JKW, Sattigeri, P and Wornell, GW. 2019. "Learning new tricks from old dogs: Multi-source transfer learning from pre-trained networks." Advances in Neural Information Processing Systems, 32.
dc.relation.journal	Advances in Neural Information Processing Systems	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-02-03T17:11:48Z
dspace.orderedauthors	Lee, JKW; Sattigeri, P; Wornell, GW	en_US
dspace.date.submission	2021-02-03T17:11:50Z
mit.journal.volume	32	en_US
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: NeurIPS-2019-learning-new-tric ...
Size:: 700.0Kb
Format:: PDF
Description:: Published version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/137379.2	2022-01-07T20:22:44Z	Authority information verified/added.
1	1721.1/137379*	2021-11-04T17:21:35Z

DSpace@MIT

Notice

Learning new tricks from old dogs: Multi-source transfer learning from pre-trained networks

Files in this item

This item appears in the following Collection(s)

Version History