Large Language Model Routing with Benchmark Datasets

Ou, Anthony C.

dc.contributor.advisor	Thompson, Neil
dc.contributor.author	Ou, Anthony C.
dc.date.accessioned	2024-03-21T19:10:03Z
dc.date.available	2024-03-21T19:10:03Z
dc.date.issued	2024-02
dc.date.submitted	2024-03-04T16:38:12.047Z
dc.identifier.uri	https://hdl.handle.net/1721.1/153846
dc.description.abstract	There is a rapidly growing number of open-source Large Language Models (LLMs) and benchmark datasets to compare them. While some models dominate these benchmarks, no single model typically achieves the best accuracy in all tasks and use cases. With a new dataset, it can be difficult to determine which LLM is best suited to the task. In this work we will address the challenges associated with selecting the best LLM model out of a collection for a new task. To do so, benchmark datasets are repurposed to learn a “router” model for this LLM selection, such that the “router” model will solve a collection of binary classification tasks. This work will demonstrate the utility and limitations of learning model routers from various benchmark datasets, where performance is improved upon using any single model for all tasks.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Large Language Model Routing with Benchmark Datasets
dc.type	Thesis
dc.description.degree	M.Eng.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science

Files in this item

Name:: ou-aou-meng-eecs-2024-thesis.pdf
Size:: 2.330Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record