Deep MinMax Networks
Author(s)
Lohmiller, Winfried; Gassert, Philipp; Slotine, Jean-Jacques
DownloadAccepted version (1.133Mb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
While much progress has been achieved over the last decades in neuro-inspired
machine learning, there are still fundamental theoretical problems in gradient-based
learning using combinations of neurons. These problems, such as saddle points and
suboptimal plateaus of the cost function, can lead in theory and practice to failures
of learning. In addition, the discrete step size selection of the gradient is problematic
since too large steps can lead to instability and too small steps slow down the learning.
This paper describes an alternative discrete MinMax learning approach for continuous piece-wise linear functions. Global exponential convergence of the algorithm
is established using Contraction Theory with Inequality Constraints [6], which is extended from the continuous to the discrete case in this paper:
• The parametrization of each linear function piece is, in contrast to deep learning, linear in the proposed MinMax network. This allows a linear regression
stability proof as long as measurements do not transit from one linear region to
its neighbouring linear region.
• The step size of the discrete gradient descent is Lagrangian limited orthogonal
to the edge of two neighbouring linear functions. It will be shown that this Lagrangian step limitation does not decrease the convergence of the unconstrained
system dynamics in contrast to a step size limitation in the direction of the gradient.
We show that the convergence rate of a constrained piece-wise linear function learning is equivalent to the exponential convergence rates of the individual local linear
regions.
Description
2021 60th IEEE Conference on Decision and Control (CDC) December 13-15, 2021. Austin, Texas
Date issued
2021-12-14Department
Massachusetts Institute of Technology. Department of Mechanical Engineering; Massachusetts Institute of Technology. Nonlinear Systems LaboratoryPublisher
IEEE|2021 60th IEEE Conference on Decision and Control (CDC)
Citation
Lohmiller, Winfried, Gassert, Philipp and Slotine, Jean-Jacques. 2021. "Deep MinMax Networks."
Version: Author's final manuscript