Red Teaming Language Conditioned Robotic Behavior

Abhangi, Nishant

dc.contributor.advisor	Agrawal, Pulkit
dc.contributor.author	Abhangi, Nishant
dc.date.accessioned	2024-10-09T18:31:30Z
dc.date.available	2024-10-09T18:31:30Z
dc.date.issued	2024-09
dc.date.submitted	2024-10-07T14:34:37.574Z
dc.identifier.uri	https://hdl.handle.net/1721.1/157255
dc.description.abstract	Natural language instruction following capabilities are important for robots to follow tasks specified by human commands. Hence, many language conditioned robots have been trained on a wide variety of datasets with tasks annotated by natural language instructions. However, these datasets are often limited in their size and hence the distribution and nature of the instructions given by real world users might be different from that in the datasets. This makes it unclear how these robots will perform in real world environments. Hence, a large scale evaluation with diverse instructions is needed to benchmark the performance of these robots. However, using humans to collect more annotations is prohibitively expensive. We show that recent large language models provide a scalable and inexpensive way to do such an evaluation. Moreover, there is a large performance drop in robots when evaluated on this larger set of instructions. We also show that we can use different prompts to LLMs to control properties such as diversity of the generated instructions.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Red Teaming Language Conditioned Robotic Behavior
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: abhangi-abhangin-meng-eecs-202 ...
Size:: 654.6Kb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record