Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation

Caren, Matthew; Chandra, Kartik; Tenenbaum, Joshua; Ragan-Kelley, Jonathan; Ma, Karima

dc.contributor.author	Caren, Matthew
dc.contributor.author	Chandra, Kartik
dc.contributor.author	Tenenbaum, Joshua
dc.contributor.author	Ragan-Kelley, Jonathan
dc.contributor.author	Ma, Karima
dc.date.accessioned	2025-01-29T19:32:54Z
dc.date.available	2025-01-29T19:32:54Z
dc.date.issued	2024-12-03
dc.identifier.isbn	979-8-4007-1131-2
dc.identifier.uri	https://hdl.handle.net/1721.1/158128
dc.description	SA Conference Papers ’24, December 03–06, 2024, Tokyo, Japan	en_US
dc.description.abstract	We present a method for automatically producing human-like vocal imitations of sounds: the equivalent of “sketching,” but for auditory rather than visual representation. Starting with a simulated model of the human vocal tract, we first try generating vocal imitations by tuning the model’s control parameters to make the synthesized vocalization match the target sound in terms of perceptually-salient auditory features. Then, to better match human intuitions, we apply a cognitive theory of communication to take into account how human speakers reason strategically about their listeners. Finally, we show through several experiments and user studies that when we add this type of communicative reasoning to our method, it aligns with human intuitions better than matching auditory features alone does. This observation has broad implications for the study of depiction in computer graphics.	en_US
dc.publisher	ACM\|SIGGRAPH Asia 2024 Conference Papers	en_US
dc.relation.isversionof	https://doi.org/10.1145/3680528.3687679	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation	en_US
dc.type	Article	en_US
dc.identifier.citation	Caren, Matthew, Chandra, Kartik, Tenenbaum, Joshua, Ragan-Kelley, Jonathan and Ma, Karima. 2024. "Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation."
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences	en_US
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2025-01-01T08:51:10Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2025-01-01T08:51:11Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: license_rdf
Size:: 40bytes
Format:: application/rdf+xml

View/Open

Name:: 3680528.3687679.pdf
Size:: 3.274Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record