Bayesian Active Structure Learning for Gaussian Process Probabilistic Programs

Lin, Gloria Z.

dc.contributor.advisor	Mansinghka, Vikash
dc.contributor.advisor	Zhi-Xuan, Tan
dc.contributor.author	Lin, Gloria Z.
dc.date.accessioned	2022-06-15T13:01:33Z
dc.date.available	2022-06-15T13:01:33Z
dc.date.issued	2022-02
dc.date.submitted	2022-02-22T18:32:04.574Z
dc.identifier.uri	https://hdl.handle.net/1721.1/143176
dc.description.abstract	What data should we gather to learn about the underlying structure of the world as quickly as possible, especially in cases where data is sparse or expensive to acquire? Structure learning techniques for Gaussian process (GP) probabilistic programs provide a rich framework for inferring qualitative structure in data. In this thesis, we improve the data-efficiency of probabilistic GP structure learning by extending it to the active learning setting. We present a sequential Monte Carlo algorithm for Bayesian active learning for GPs with a novel objective function, Kernel Information Gain (IG-K), to reduce uncertainty over model structure and parameters. As a baseline for comparison, we also formulate a second objective function, Predictive Information Gain (IG-P), that reduces uncertainty over the posterior predictive distribution. We empirically validate that active learning with our novel IG-K objective is able to more accurately infer the structure of synthetic datasets using fewer datapoints than active learning with IG-P. We also validate the underlying active learning inference algorithm using simulation-based calibration. Finally, we test our active learning algorithm on a real-world dataset with complex structure. Collectively, the results provide a deeper understanding of the benefits and limitations of active structure learning using Gaussian processes, revealing that an active selection strategy suited for inferring the model structure and parameters may not favorable for providing accurate predictions. These findings suggest directions for future active learning approaches which combine the IG-K and IG-P objectives, leveraging the advantages of each objective to efficiently discover structure in data and provide accurate predictions.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright MIT
dc.rights.uri	http://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Bayesian Active Structure Learning for Gaussian Process Probabilistic Programs
dc.type	Thesis
dc.description.degree	M.Eng.
dc.description.degree	S.B.
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
mit.thesis.degree	Master
mit.thesis.degree	Bachelor
thesis.degree.name	Master of Engineering in Electrical Engineering and Computer Science
thesis.degree.name	Bachelor of Science in Computer Science and Engineering

Files in this item

Name:: Lin-gzlin-meng-eecs-2022-thesis.pdf
Size:: 3.414Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record