Neural Networks on Eigenvector Data

Lim, Derek

Author(s)

Lim, Derek

DownloadThesis PDF (8.677Mb)

Advisor

Jegelka, Stefanie

Terms of use

In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/

Metadata

Show full item record

Abstract

The need to process eigenvectors derived from data arises across numerous domains in computing and the sciences. However, eigenvectors differ from other types of data, as they have particular symmetries; for any eigenvector of a matrix, the negation of that vector is also an eigenvector of the same eigenvalue, so there are sign symmetries. There are also more general continuous basis symmetries in higher dimensional eigenspaces. In this thesis, we present the first neural networks that process eigenvector input while respecting these symmetries. We build neural networks that are invariant to sign and basis symmetries as well as neural networks that are equivariant to sign symmetries. Under certain conditions, these networks are provably universal — they can approximate any continuous functions with the desired invariances. When used with Laplacian eigenvectors, our invariant neural networks are provably powerful for graph representation learning, as they can approximate several classes of important functions on graphs. Our networks empirically improve machine learning models with eigenvectors, in tasks including molecular graph regression, learning expressive graph representations, and learning neural fields on triangle meshes.

Date issued

2023-06

URI

https://hdl.handle.net/1721.1/151606

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Publisher

Massachusetts Institute of Technology

Collections

Graduate Theses