Show simple item record

dc.contributor.advisorSolar-Lezama, Armando
dc.contributor.authorBowers, Matthew L.
dc.date.accessioned2023-07-31T19:35:02Z
dc.date.available2023-07-31T19:35:02Z
dc.date.issued2023-06
dc.date.submitted2023-07-13T14:16:20.803Z
dc.identifier.urihttps://hdl.handle.net/1721.1/151374
dc.description.abstractThis thesis introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm towards abstractions that maximally capture shared structures in the corpus. We present an implementation of the approach in a tool called Stitch and evaluate it against the state-of-the-art deductive library learning algorithm from DreamCoder. Our evaluation shows that Stitch is 3-4 orders of magnitude faster and uses 2 orders of magnitude less memory while maintaining comparable or better library quality (as measured by compressivity). We also demonstrate Stitch’s scalability on corpora containing hundreds of complex programs that are intractable with prior deductive approaches and show empirically that it is robust to terminating the search procedure early—further allowing it to scale to challenging datasets by means of early stopping. We publish the code, the documentation, a tutorial, and a Python library for interfacing with our for our Rust implementation of Stitch. Tutorial & Documentation (Python Library): https://stitch-bindings.read thedocs.io/en/stable/intro/tutorial.html Rust Implementation: https://github.com/mlb2251/stitch Artifact (Awarded: Reusable): https://github.com/mlb2251/stitch-artifact
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://rightsstatements.org/page/InC-EDU/1.0/
dc.titleTop-Down Synthesis for Library Learning
dc.typeThesis
dc.description.degreeS.M.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.orcid0000-0001-8450-7033
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record