Show simple item record

dc.contributor.authorCaren, Matthew
dc.contributor.authorChandra, Kartik
dc.contributor.authorTenenbaum, Joshua
dc.contributor.authorRagan-Kelley, Jonathan
dc.contributor.authorMa, Karima
dc.date.accessioned2025-01-29T19:32:54Z
dc.date.available2025-01-29T19:32:54Z
dc.date.issued2024-12-03
dc.identifier.isbn979-8-4007-1131-2
dc.identifier.urihttps://hdl.handle.net/1721.1/158128
dc.descriptionSA Conference Papers ’24, December 03–06, 2024, Tokyo, Japanen_US
dc.description.abstractWe present a method for automatically producing human-like vocal imitations of sounds: the equivalent of “sketching,” but for auditory rather than visual representation. Starting with a simulated model of the human vocal tract, we first try generating vocal imitations by tuning the model’s control parameters to make the synthesized vocalization match the target sound in terms of perceptually-salient auditory features. Then, to better match human intuitions, we apply a cognitive theory of communication to take into account how human speakers reason strategically about their listeners. Finally, we show through several experiments and user studies that when we add this type of communicative reasoning to our method, it aligns with human intuitions better than matching auditory features alone does. This observation has broad implications for the study of depiction in computer graphics.en_US
dc.publisherACM|SIGGRAPH Asia 2024 Conference Papersen_US
dc.relation.isversionofhttps://doi.org/10.1145/3680528.3687679en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleSketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitationen_US
dc.typeArticleen_US
dc.identifier.citationCaren, Matthew, Chandra, Kartik, Tenenbaum, Joshua, Ragan-Kelley, Jonathan and Ma, Karima. 2024. "Sketching With Your Voice: "Non-Phonorealistic" Rendering of Sounds via Vocal Imitation."
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2025-01-01T08:51:10Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2025-01-01T08:51:11Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record