I-theory on depth vs width: hierarchical function composition

Poggio, Tomaso; Anselmi, Fabio; Rosasco, Lorenzo

dc.contributor.author	Poggio, Tomaso
dc.contributor.author	Anselmi, Fabio
dc.contributor.author	Rosasco, Lorenzo
dc.date.accessioned	2015-12-30T02:37:36Z
dc.date.available	2015-12-30T02:37:36Z
dc.date.issued	2015-12-29
dc.identifier.uri	http://hdl.handle.net/1721.1/100559
dc.description.abstract	Deep learning networks with convolution, pooling and subsampling are a special case of hierar- chical architectures, which can be represented by trees (such as binary trees). Hierarchical as well as shallow networks can approximate functions of several variables, in particular those that are com- positions of low dimensional functions. We show that the power of a deep network architecture with respect to a shallow network is rather independent of the specific nonlinear operations in the network and depends instead on the the behavior of the VC-dimension. A shallow network can approximate compositional functions with the same error of a deep network but at the cost of a VC-dimension that is exponential instead than quadratic in the dimensionality of the function. To complete the argument we argue that there exist visual computations that are intrinsically compositional. In particular, we prove that recognition invariant to translation cannot be computed by shallow networks in the presence of clutter. Finally, a general framework that includes the compositional case is sketched. The key con- dition that allows tall, thin networks to be nicer that short, fat networks is that the target input-output function must be sparse in a certain technical sense.	en_US
dc.description.sponsorship	This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF - 1231216.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM)	en_US
dc.relation.ispartofseries	CBMM Memo Series;041
dc.rights	Attribution-NonCommercial 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc/3.0/us/	*
dc.subject	Deep Convolutional Learning Networks (DCLNs)	en_US
dc.subject	Hierarchy	en_US
dc.subject	i-theory	en_US
dc.title	I-theory on depth vs width: hierarchical function composition	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US
dc.type	Other	en_US

Files in this item

Name:: CBMM-Memo-041.pdf
Size:: 1.178Mb
Format:: PDF

View/Open

Name:: license_rdf
Size:: 1.346Kb
Format:: application/rdf+xml

View/Open

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record