LLMR: Real-time Prompting of Interactive Worlds using Large Language Models

De La Torre, Fernanda; Fang, Cathy Mengying; Huang, Han; Banburski-Fahey, Andrzej; Amores Fernandez, Judith; Lanier, Jaron

dc.contributor.author	De La Torre, Fernanda
dc.contributor.author	Fang, Cathy Mengying
dc.contributor.author	Huang, Han
dc.contributor.author	Banburski-Fahey, Andrzej
dc.contributor.author	Amores Fernandez, Judith
dc.contributor.author	Lanier, Jaron
dc.date.accessioned	2024-06-04T19:27:28Z
dc.date.available	2024-06-04T19:27:28Z
dc.date.issued	2024-05-11
dc.identifier.isbn	979-8-4007-0330-0
dc.identifier.uri	https://hdl.handle.net/1721.1/155184
dc.description.abstract	We present Large Language Model for Mixed Reality (LLMR), a framework for the real-time creation and modification of interactive Mixed Reality experiences using LLMs. LLMR leverages novel strategies to tackle difficult cases where ideal training data is scarce, or where the design goal requires the synthesis of internal dynamics, intuitive analysis, or advanced interactivity. Our framework relies on text interaction and the Unity game engine. By incorporating techniques for scene understanding, task planning, self-debugging, and memory management, LLMR outperforms the standard GPT-4 by 4x in average error rate. We demonstrate LLMR’s cross-platform interoperability with several example worlds, and evaluate it on a variety of creation and modification tasks to show that it can produce and edit diverse objects, tools, and scenes. Finally, we conducted a usability study (N=11) with a diverse set that revealed participants had positive experiences with the system and would use it again.	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	10.1145/3613904.3642579	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	LLMR: Real-time Prompting of Interactive Worlds using Large Language Models	en_US
dc.type	Article	en_US
dc.identifier.citation	De La Torre, Fernanda, Fang, Cathy Mengying, Huang, Han, Banburski-Fahey, Andrzej, Amores Fernandez, Judith et al. 2024. "LLMR: Real-time Prompting of Interactive Worlds using Large Language Models."
dc.contributor.department	Massachusetts Institute of Technology. Media Laboratory
dc.contributor.department	Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2024-06-01T07:52:19Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2024-06-01T07:52:19Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: license_rdf
Size:: 40bytes
Format:: application/rdf+xml

View/Open

Name:: 3613904.3642579.pdf
Size:: 17.08Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record