Recently added

On the Power of Decision Trees in Auto-Regressive Language Modeling

Gan, Yulu; Galanti, Tomer; Poggio, Tomaso; Malach, Eran (Center for Brains, Minds and Machines (CBMM), 2024-09-27)

Originally proposed for handling time series data, Auto-regressive Decision Trees (ARDTs) have not yet been explored for language modeling. This paper delves into both the theoretical and practical applications of ARDTs ...

For HyperBFs AGOP is a greedy approximation to gradient descent

Gan, Yulu; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), 2024-07-13)

The Average Gradient Outer Product (AGOP) provides a novel approach to feature learning in neural networks. We applied both AGOP and Gradient Descent to learn the matrix M in the Hyper Basis Function Network (HyperBF) and ...

Compositional Sparsity of Learnable Functions

Poggio, Tomaso; Fraser, Maia (Center for Brains, Minds and Machines (CBMM), 2024-02-08)

Neural networks have demonstrated impressive success in various domains, raising the question of what fundamental principles underlie the effectiveness of the best AI systems and quite possibly of human intelligence. This ...

DSpace@MIT

CBMM Memo Series: Recent submissions

On the Power of Decision Trees in Auto-Regressive Language Modeling

For HyperBFs AGOP is a greedy approximation to gradient descent

Compositional Sparsity of Learnable Functions

CBMM Memo Series: Recent submissions

On the Power of Decision Trees in Auto-Regressive Language Modeling ﻿

For HyperBFs AGOP is a greedy approximation to gradient descent ﻿

Compositional Sparsity of Learnable Functions ﻿

On the Power of Decision Trees in Auto-Regressive Language Modeling

For HyperBFs AGOP is a greedy approximation to gradient descent

Compositional Sparsity of Learnable Functions