The Augmented Geometrically Spaced Transform : applications of the single channel frequency estimator

Feldman, Jonathan Michael,S.M.Massachusetts Institute of Technology.

Author(s)

Feldman, Jonathan Michael,S.M.Massachusetts Institute of Technology.

Download1256659362-MIT.pdf (8.960Mb)

Alternative title

Applications of the single channel frequency estimator

Other Contributors

Program in Media Arts and Sciences (Massachusetts Institute of Technology)

Advisor

Joseph A. Paradiso.

Terms of use

MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582

Metadata

Show full item record

Abstract

The Augmented Geometrically Spaced Transform (AGST) is an auditory model that is based on an inversion of the acoustic piano, where the piano produces music and the transform analyses it. In contrast with the standard spectrogram, which is a complex frequency vector versus time, the AGST is based around a matrix of frequencies, known as the AGST Frequency Matrix, where for every frequency in the matrix, a spectral envelope is computed using a Single Channel Frequency Estimator (SCFE). The core invention of the thesis is the algorithm for the SCFE, which computes spectral envelopes with maximally high definition in a computationally efficient manner. A bank of SCFEs is assembled into a constant Q transform, known as a Geometrically Spaced Transform (GST). The GST can be used to visualize harmonics inside of musical notes, or audio in general, in a constant Q fashion. It is then shown that the AGST is a good front-end model for computational pitch perception. For example, it can be used to solve an important problem in auditory perception, the case of the missing fundamental. The entire thesis is framed in the context of building artificially intelligent music systems, including synthetic listeners (machines that listen in the way that people do), and synthetic performers (machines that allow for interactive music performance).

Description

Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February, 2021

Cataloged from the official PDF version of thesis.

Includes bibliographical references (pages 99-103).

Date issued

2021

URI

https://hdl.handle.net/1721.1/131006

Department

Program in Media Arts and Sciences (Massachusetts Institute of Technology)

Publisher

Massachusetts Institute of Technology

Keywords

Program in Media Arts and Sciences

Collections

Graduate Theses