Multimodal Graphical User Interface for 3D Model
Fabrication Through Generative AI

Báez Alicea, Isabel

dc.contributor.advisor	Mueller, Stefanie
dc.contributor.author	Báez Alicea, Isabel
dc.date.accessioned	2025-04-14T14:05:05Z
dc.date.available	2025-04-14T14:05:05Z
dc.date.issued	2025-02
dc.date.submitted	2025-04-03T14:06:10.311Z
dc.identifier.uri	https://hdl.handle.net/1721.1/159092
dc.description.abstract	In recent years, three-dimensional model generation and manipulation through generative AI has seen significant developments. Current projects enable the generation of threedimensional assets from natural language prompts and input images, as well as functionalityaware model manipulation through mesh segmentation and categorization. However, all these workflows lack a coherent, unified platform that caters to users’ needs and each method’s technologies. Programs that rely on terminal-based commands lack the graphics needed for model interactions, and plugin extensions for 3D modeling applications are unintuitive and hard to extend for new functionalities. Additionally, both approaches require users to have prior computer engineering and/or 3D graphics knowledge. For this thesis, I propose the creation of a web-based, multimodal graphical user interface that consolidates all these different technologies in a single platform. By supporting model stylization and model generation (both from text prompts and input images), users can utilize combined workflows and expand the range of output possibilities for 3D asset creation. Other features in our interface include model uploading, saving, and downloading to enable a continuous stream of work on a single 3D asset. Apart from all this, we expand the current capabilities of existing image-to-3D generation programs by enabling users to combine up to six images together and create a merged 3D object. Each of these images corresponds to a view angle from which the outputted mesh will be built.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Multimodal Graphical User Interface for 3D Model Fabrication Through Generative AI
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: baez-ibaez-meng-eecs-2025-thes ...
Size:: 26.14Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record

Multimodal Graphical User Interface for 3D Model Fabrication Through Generative AI

Files in this item

This item appears in the following Collection(s)