MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

The Augmented Geometrically Spaced Transform : applications of the single channel frequency estimator

Author(s)
Feldman, Jonathan Michael,S.M.Massachusetts Institute of Technology.
Thumbnail
Download1256659362-MIT.pdf (8.960Mb)
Alternative title
Applications of the single channel frequency estimator
Other Contributors
Program in Media Arts and Sciences (Massachusetts Institute of Technology)
Advisor
Joseph A. Paradiso.
Terms of use
MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
The Augmented Geometrically Spaced Transform (AGST) is an auditory model that is based on an inversion of the acoustic piano, where the piano produces music and the transform analyses it. In contrast with the standard spectrogram, which is a complex frequency vector versus time, the AGST is based around a matrix of frequencies, known as the AGST Frequency Matrix, where for every frequency in the matrix, a spectral envelope is computed using a Single Channel Frequency Estimator (SCFE). The core invention of the thesis is the algorithm for the SCFE, which computes spectral envelopes with maximally high definition in a computationally efficient manner. A bank of SCFEs is assembled into a constant Q transform, known as a Geometrically Spaced Transform (GST). The GST can be used to visualize harmonics inside of musical notes, or audio in general, in a constant Q fashion. It is then shown that the AGST is a good front-end model for computational pitch perception. For example, it can be used to solve an important problem in auditory perception, the case of the missing fundamental. The entire thesis is framed in the context of building artificially intelligent music systems, including synthetic listeners (machines that listen in the way that people do), and synthetic performers (machines that allow for interactive music performance).
Description
Thesis: S.M., Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, February, 2021
 
Cataloged from the official PDF version of thesis.
 
Includes bibliographical references (pages 99-103).
 
Date issued
2021
URI
https://hdl.handle.net/1721.1/131006
Department
Program in Media Arts and Sciences (Massachusetts Institute of Technology)
Publisher
Massachusetts Institute of Technology
Keywords
Program in Media Arts and Sciences

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.