Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation

Caren, Matthew; Chandra, Kartik; Tenenbaum, Joshua; Ragan-Kelley, Jonathan; Ma, Karima

Author(s)

Caren, Matthew; Chandra, Kartik; Tenenbaum, Joshua; Ragan-Kelley, Jonathan; Ma, Karima

Download3680528.3687679.pdf (3.274Mb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/

Metadata

Show full item record

Abstract

We present a method for automatically producing human-like vocal imitations of sounds: the equivalent of “sketching,” but for auditory rather than visual representation. Starting with a simulated model of the human vocal tract, we first try generating vocal imitations by tuning the model’s control parameters to make the synthesized vocalization match the target sound in terms of perceptually-salient auditory features. Then, to better match human intuitions, we apply a cognitive theory of communication to take into account how human speakers reason strategically about their listeners. Finally, we show through several experiments and user studies that when we add this type of communicative reasoning to our method, it aligns with human intuitions better than matching auditory features alone does. This observation has broad implications for the study of depiction in computer graphics.

Description

SA Conference Papers ’24, December 03–06, 2024, Tokyo, Japan

Date issued

2024-12-03

URI

https://hdl.handle.net/1721.1/158128

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences

Publisher

ACM|SIGGRAPH Asia 2024 Conference Papers

Citation

Caren, Matthew, Chandra, Kartik, Tenenbaum, Joshua, Ragan-Kelley, Jonathan and Ma, Karima. 2024. "Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation."

Version: Final published version

ISBN

979-8-4007-1131-2

Collections

MIT Open Access Articles

The following license files are associated with this item:

Creative Commons