Deep vs. shallow networks : An approximation theory perspective

Mhaskar, Hrushikesh; Poggio, Tomaso

dc.contributor.author	Mhaskar, Hrushikesh
dc.contributor.author	Poggio, Tomaso
dc.date.accessioned	2016-08-12T22:44:41Z
dc.date.available	2016-08-12T22:44:41Z
dc.date.issued	2016-08-12
dc.identifier.uri	http://hdl.handle.net/1721.1/103911
dc.description.abstract	The paper briefly reviews several recent results on hierarchical architectures for learning from examples, that may formally explain the conditions under which Deep Convolutional Neural Networks perform much better in function approximation problems than shallow, one-hidden layer architectures. The paper announces new results for a non-smooth activation function – the ReLU function – used in present-day neural networks, as well as for the Gaussian networks. We propose a new definition of relative dimension to encapsulate different notions of sparsity of a function class that can possibly be exploited by deep networks but not by shallow ones to drastically reduce the complexity required for approximation and learning.	en_US
dc.description.sponsorship	This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF – 1231216.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM), arXiv	en_US
dc.relation.ispartofseries	CBMM Memo Series;054
dc.rights	Attribution-NonCommercial-ShareAlike 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/us/	*
dc.subject	hierarchical architectures	en_US
dc.subject	Deep Convolutional Neural Networks	en_US
dc.subject	ReLU function	en_US
dc.subject	Gaussian networks	en_US
dc.title	Deep vs. shallow networks : An approximation theory perspective	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US
dc.type	Other	en_US
dc.identifier.citation	arXiv:1608.03287	en_US
dc.audience.educationlevel

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record