Show simple item record

dc.contributor.authorBrade, Stephen
dc.contributor.authorAnderson, Sam
dc.contributor.authorKumar, Rithesh
dc.contributor.authorJin, Zeyu
dc.contributor.authorTruong, Anh
dc.date.accessioned2025-09-19T17:51:07Z
dc.date.available2025-09-19T17:51:07Z
dc.date.issued2025-04-25
dc.identifier.isbn979-8-4007-1394-1
dc.identifier.urihttps://hdl.handle.net/1721.1/162765
dc.descriptionCHI ’25, Yokohama, Japanen_US
dc.description.abstractNovice content creators often invest significant time recording expressive speech for social media videos. While recent advancements in text-to-speech (TTS) technology can generate highly realistic speech in various languages and accents, many struggle with unintuitive or overly granular TTS interfaces. We propose simplifying TTS generation by allowing users to specify high-level context alongside their script. Our Wizard-of-Oz system, SpeakEasy, leverages user-provided context to inform and influence TTS output, enabling iterative refinement with high-level feedback. This approach was informed by two 8-subject formative studies: one examining content creators’ experiences with TTS, and the other drawing on effective strategies from voice actors. Our evaluation shows that participants using SpeakEasy were more successful in generating performances matching their personal standards, without requiring significantly more effort than leading industry interfaces.en_US
dc.publisherACM|CHI Conference on Human Factors in Computing Systemsen_US
dc.relation.isversionofhttps://doi.org/10.1145/3706598.3714263en_US
dc.rightsCreative Commons Attribution-Noncommercial-ShareAlikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleSpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creationen_US
dc.typeArticleen_US
dc.identifier.citationStephen Brade, Sam Anderson, Rithesh Kumar, Zeyu Jin, and Anh Truong. 2025. SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation. In Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25). Association for Computing Machinery, New York, NY, USA, Article 756, 1–19.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.identifier.mitlicensePUBLISHER_POLICY
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2025-08-01T08:16:38Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2025-08-01T08:16:39Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record