Show simple item record

dc.contributor.authorDe La Torre, Fernanda
dc.contributor.authorFang, Cathy Mengying
dc.contributor.authorHuang, Han
dc.contributor.authorBanburski-Fahey, Andrzej
dc.contributor.authorAmores Fernandez, Judith
dc.contributor.authorLanier, Jaron
dc.date.accessioned2024-06-04T19:27:28Z
dc.date.available2024-06-04T19:27:28Z
dc.date.issued2024-05-11
dc.identifier.isbn979-8-4007-0330-0
dc.identifier.urihttps://hdl.handle.net/1721.1/155184
dc.description.abstractWe present Large Language Model for Mixed Reality (LLMR), a framework for the real-time creation and modification of interactive Mixed Reality experiences using LLMs. LLMR leverages novel strategies to tackle difficult cases where ideal training data is scarce, or where the design goal requires the synthesis of internal dynamics, intuitive analysis, or advanced interactivity. Our framework relies on text interaction and the Unity game engine. By incorporating techniques for scene understanding, task planning, self-debugging, and memory management, LLMR outperforms the standard GPT-4 by 4x in average error rate. We demonstrate LLMR’s cross-platform interoperability with several example worlds, and evaluate it on a variety of creation and modification tasks to show that it can produce and edit diverse objects, tools, and scenes. Finally, we conducted a usability study (N=11) with a diverse set that revealed participants had positive experiences with the system and would use it again.en_US
dc.publisherACMen_US
dc.relation.isversionof10.1145/3613904.3642579en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computing Machineryen_US
dc.titleLLMR: Real-time Prompting of Interactive Worlds using Large Language Modelsen_US
dc.typeArticleen_US
dc.identifier.citationDe La Torre, Fernanda, Fang, Cathy Mengying, Huang, Han, Banburski-Fahey, Andrzej, Amores Fernandez, Judith et al. 2024. "LLMR: Real-time Prompting of Interactive Worlds using Large Language Models."
dc.contributor.departmentMassachusetts Institute of Technology. Media Laboratory
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2024-06-01T07:52:19Z
dc.language.rfc3066en
dc.rights.holderThe author(s)
dspace.date.submission2024-06-01T07:52:19Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record